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DNA POLYMERASES WITH ENHANCED THERMOSTABILITY AND ENHANCED 
LENGTH AND EFFICIENCY OF PRIMER EXTENSION 

BACKGROU ND OF TKF INVENTION 

The present invention is directed to DNA polymerases, and more particularly, to a 
novel mutation of Thermus aguaticus and Thermus flavus DNA polymerases exhibiting enhanced 
thermostability over any form f>f these enzymes now known. The invention is also directed to recom- 
binant DNA sequences encoding such DNA polymerases, and vector plasmids and host cells suitable 
for the expression of these recombinant DNA sequences. Hie invention is also directed to a novel 
formulation of the DNA polymerases of the present invention and other DNA polymerases, which 
formulation of enzymes is capable of efficiently catalyzing the amplification by PCR (the polymerase 
chain reaction) of unusually long and faithful products. 

DNA polymerase obtained from the hot springs bacterium Thermus aguaticus (Taq 
DNA polymerase) has been demonstrated to be quite useful in amplification of DNA, in DNA 
sequencing, and in related DNA primer extension techniques because it is thermostable. Thermostable 
is defined herein as having the ability to withstand temperatures up to 95<>C for many minutes without 
becoming irreversibly denatured, and the ability to polymerize DNA at high temperatures (60° to 75° 
C). The DNA and amino acid sequences described by Lawyer etal. . J. Biol. Chem. 264:6427 
(1989), GcnBank Accession No. J04639, define the gene encoding Thermus aouaticus DNA 
polymerase and the enzyme Thermus aouaticus DNA polymerase as those terms are used in this 
application. The highly similar DNA polymerase (Tfl DNA polymerase) expressed by the closely 
related bacterium Thermus flavus is defined by the DNA and amino acid sequences described by 
Akhmetzjanov, A. A., and Vakhitov, V.A. (1992) Nucleic Acids Research 20:5839, GenBank 
Accession No. X66105. These enzymes are representative of a family of DNA polymerases, also 
md uding Thermus thermophilic DNA polymerase, which are thermostable. These enzymes lack a 3*- 
exonuclease activity such as that which is effective for editing purposes in DNA polymerases such as 
E i _cpH DNA polymerase I, and phages T7, T3, and T4 DNA polymerases. 

Gelfand CQL, U.S. Patent 4,889,818 describe a wild-type (abbreviation used here: 
WT >» native Tnennus aguaticus DNA polymerase. Gelfand etaL, U.S. Patent 5,079,352 describe a 
recombinant DNA sequence which encodes a mutein of Thermus aouaticus DNA polymerase from 
which the N-terminal 289 amino acids of Thermus aguaticus DNA polymerase have been deleted 
(claim 3 of 352, commercial name Stoffel Fragment, abbreviation used here: ST), and a recombinant 
DNA sequence which encodes a mutein of Thermus aouaticus DNA polymerase from which the 
N-terminal 3 amino acids of Thermus aouaticus DNA polymerase have been deleted (claim 4 of *352, 
trade name AmpliTaq, abbreviation used here: AT). Gelfand etal. report their muteins to be "fully 
active" in assays for DNA polymerase, but data as to their maximum thermostability is not presented. 



The development of other enzymatically active mutein derivatives of Thermus 
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DNA polymerase is hatred, however.' by the *M of the impact of any 
particular modification on .he structural and functional characteristics of the proteu, Many factors, 
ending potential disruption of critical bonding and folding patterns, must be considered in modifying 
an enzyme and the DNA for expression. A significant problem associated with the creation of N- 
fcnninal deletion muteins of ^-temperature Thermusj^ ™A polymerase is the prospect tha, 
the ^terminus of the new protein may become wildly disordered in the higher temperature 
„ ^g unfavorable interactions with the catalytic domains) of the protein, and resulting tn 
denamration. In fact, a few deletions have been constructed which appear to leave ft. identifiable 
domain for DNA poly— intact, yet none of these deletions have thennostabiHty at tenures as 
hi* as 99» C While Tlxm*^™* DNA polymerase has shown remarkable thermostab.hty a. 
much higher temperatures man mat exhibited by other DNA polymerases, it loses enzymatic activity 
when exposed to temperatures above 95-97oC. Moreover, its fidelity at 72o C (the recommended 
temperature for DNA synthesis) is limited to an effective error rate of approximately 1/9000 bp. 
Gelfand et^'s mutem ST of Thermus^!^ »NA F*— ^ " ***** ™ 
; deletion) is significantly more stable than AT, but ST exhibits significantly decreased activity when 
cycled to 98oC. and much less, if any. activity when cycled to 99o C , during the denaturation phase of 
PGR cycles. 

Amplification of DNA spans by me polymerase chain reaction (PGR) has become an 
imp ortant and widespread tool of genetic analysis since the introduction of thermostable Ta q DNA 
0 polymerase for its catalysis. Two limitations to prior art methods of PCR are the fidelity of the fin, 
product and the size of the product span ma, can be amplified. The fide.ity problem has been partly 
addressed by the rep.acemen. of Taq DNA polymerase by Pfu (Pyococcus furiosus) DNA 
polymerase, which exhibits an integral 3'-(edUing) exonuclease that apparently reduces the 
mutations/bp/cycle from about 10^ to about 10 However, experiments suggest mat this enzyme ,s 
* unable to amplify certain DNA sequence, in the size range ofl.5 to 2 kb that Klentaq-27* (also 

known as Klentaql) (N-terminal deletion of Taq DNA polymerase; WMB unpublished) or AmpUTaq 
(full-length Taq DNA polymerase; ref. 3) can amplify handily, and tha, Pfu is no more able (i.e. no, 
able) to amplify DNA ptoduct spans in excess of 5-7 kb than is any form of Taq DNA polymerase. 
For fuU-lengm Taq DNA Polymerase and for N-terminally truncated variants Klentaq-278. K.e»taq5 
30 and Stoffe, Fragment. PCR amplification apparently rapid.y becomes inefficient or non-existent as *e 
length of the targe, span exceeds 5-6 kb. This was shown even when 30 minutes was used dunng the 

extension step of each cycle. 

Although there are several reports of inefficient but detectable amplification at 9-10 
kbu^eUengmandoneatlSkb.mostgeneralwhcationsareBmitedtoSkb. 
35 Kainze etjL (Analytical Biochem. 202:46^9(1992)) report a PCR amplication of 

over 10 kb- a 10.9 kb and a !5.6 kb product, utilizing an enzyme of unpublished biological source 
(comn^ially available as "Hot Tub- DNA polymerase). Kainz* report achieving a barely 
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visible band at 15.6 kb after 30 cycles, starting with 1 ng of X DNA template per 100 ul of reaction 
volume. Hie efficiency of this amplification was shown to be relatively low, although a quantitative 
calculation of the efficiency was not presented. Attempts by Kainze et al. to make WT Thermus 
aquaticus DNA polymerase perform in the 10-15 kb size range were not successful, nor have 
successful results been reported by anyone else for any form of T^ermus^guaticus DNA polymerase 
in this size range. There is no report of any longer DNA products amplifiable by PCR. 

A DNA polymerase which retains its thermostability at 98° or 99©c would allow 
more efficient and convenient DNA analysis in several situations including 'colony PCR" (see Figure 
5), and/or allow thermal cycler overshoot without inactivation of the enzyme activity. A thermostable 
DNA polymerase or DNA polymerase formulation which exhibits improved fidelity relative to AT or 
WT Thermus aquaticus DNA polymerase at optimum temperatures for synthesis would be highly 
desirable for applications in which the target and product DNA is to be expressed rather than merely 
detected. 

A DNA polymerase formulation capable of efficient amplification of DNA spans in 
15 excess of 6 kb would significantly expand the scope of applications of PCR. For instance, whole 

plasmids, and constructs the size of whole plasmids, could be prepared with this method, which would 
be especially valuable in cases in which a portion of the DNA in question is toxic or incompatible with 
plasmid replication when introduced into E. coli. If this thermostable DNA polymerase preparation 
simultaneously conferred increased fidelity to the PCR amplification, the resulting large products 
would be much more accurate, active and/or valuable in research and applications, especially in 
situations involving expression of the amplifed sequence. If the thermostable DNA polymerase 
preparation allowed, in addition, more highly concentrated yields of pure product, this would enhance 
the method of PCR to the point where it could be used more effectively to replace plasmid replication 
as a means to produce desired DNA fragments in quantity. 

25 SUMMARY OF THE INVENTION 

Among the several objects of the invention, therefore, may be noted the provision of 
a DNA polymerase which can survive meaningful repeated exposure to temperatures of 99oC; the 
provision of a highly thermostable DNA polymerase which exhibits greater fidelity than Thermus 
aguaticus DNA polymerase when utilized at standard Thermus aquaticus DNA polymerase extension 
reaction temperatures; the provision of such a DNA polymerase which is useful for PCR amplification 
techniques from DNA templates and from single colonies of E. coli . single-stranded (linear) 
amphfication of DNA, nucleic acid sequencing, DNA restriction digest filling, DNA labelling, in vivo 
footprinting, and primer-directed mutagenesis. Further objects of the invention include the provision 
of recombinant DNA sequences, vectors and host cells which provide for the expression of such DNA 
35 polymerase. 

Additional objects include the provision of a formulation of DNA polymerase capable 
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of efficiently catalyzing primer extension products of greater length than permitted by conventional 
formulations, including lengths up to at least 35 kilobascs, that reduces the mutagenicity generated by 
the PCR process, particularly in comparison with prior art DNA polymerases and for any target 
lengths, that maximizes the yield of PGR target fragments and, concor :antly, enhances the intensity 
and sharpness of PCR product bands, without significant sacrifice in f; ;>iblity, specificity, and 
efficiency; and the provision of an improved process for amplification by PCR which can be utilized 
to reliably synthesize nucleic acid sequences of greater length and which can ,~: -lively utilize PCR 
products as primers. 

Briefly, therefore, the present invention is directed to a novel, recombinant DNA 
sequence encoding a DNA polymerase having an amino acid sequence comprising substantially the 
same amino acid sequence as the Thermus aquations or hennas flavus DNA polymerase, excluding 
however the N-terminal 280 amino acid residues of W Thcrmus acuaticus or the N-terminal 279 
amino acids nf^rmus flavus DNA polymerase. 

Additionally, the present invention is directed to a DNA polymerase having an amino 
acid sequence comprising substantially the same amino acid sequence of the Thcrmus aquaticus or 
msmLS i m s DNA polymerase, but lacking the N-terminal 280 amino acid residues of Ihermus 
, quati cus DNA polymerase, or the N-terminal 279 amino acids of T^ermus flavus DNA polymerase. 

In a further embodiment, the present invention is directed to a novel formulation of 
thermostable DNA polymerases, including a majority component comprised of at least one 
thermostable DNA polymerase lacking 3'^onuclease activity and a minority component comprised of 
at least one thermostable DNA polymerase exhibiting a 3Mediting) exonuclease activity. 

The present invention is also directed to a formulation of DNA polymerase which 
includes at least one DNA polymerase which in wild-type form exhibits 3'^xonuclease activity and 
which is capable of catalyzing a temperature cycle type polymerase chain reaction, wherein the 3"- 
exonuclease activity of said at least one DNA polymerase has been reduced to between about 0.2% 
and about 7%. of the 3'^conuclease activity of the at least one DNA polymerase in its wild-type 
form. 

In another aspect, a formulation of DNA polymerase comprising El and E2 is 
provided. El is one or more DNA polymerases which lack any significant 3 Exonuclease activity, 
and E2 is one or more DNA polymerases which exhibit significant 3 '^nuclease activity. The 
mixture provided has a relative DNA polymerase unit ratio of El to E2 of at least about 4: 1. 

The invention is further directed to an improvement in a process for amplification of 
nucleic acid sequences by PCR wherein the improvement comprises formulating DNA polymerase of 
the types described above. The formulation thereby created is used to catalyze primer extension 
during the PCR process, thus extending the applicable size range for efficient PCR amplification. 

DNA polymerases such as those discussed in this application are commonly 
composed of up to three identifiable and separable domains of enzymatic activity, in the physical order 
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from N -terminal to C-terminal, of S'-exonuclease, 3*-exonuclease, DNA polymerase. Taq DNA 
polymerase has never had a 3'-exomicIease, but the invention in the first part is directed to a deletion 
of its 5*-«xonuclease. Other DNA polymerases mentioned, such as Pfii DNA polymerase, do not have 
the 5*-exonuclease, but their S'-exonucIease function is central to the aspect of the invention directed 
to mixtures of DNA polymerases El (lacking significant 3'-exonuclease) and E2 (having 3'- 
exonuclease). In these mixtures, the presence of 5'-«xonuclease in either El or E2 has not been 
shown to be essential to the primary advantages of the present invention. 

Other objects and features will be in part apparent and in part pointed out hereinafter. 
SUMMARY OF ABBREVIATIONS 

The listed abbreviations, as used herein, are defined as follows: 
Abbreviations : 

bp = base pairs 

kb = kilobase; 1000 base pairs 

nt = nucleotides 

15 BME = beta-mercaptoethanol 

PPj = sodium pyrophosphate 

Pro = Pvrococcus furiosus 

Taq = Thermus aquaticus 

Tfl = Thermus flavus 

20 Tli - Thermococcus literalis 

Klentaq-nnn = N-terminally deleted Thermus aquaticus DNA polymerase that starts 
with codon nnn + 1, although that start codon and the next codon may not match the WT sequence 
because of alterations to the DNA sequence to produce a convenient restriction site. 

WT = wild-type (full length) or deletion of only 3 aa 
25 aa = amino acid(s) 

ST = Stoffel fragment, an N-terminal deletion of Thermus aquaticus DNA 

polymerase that could be named Klentaq-288. -LA = Long and 

Accurate; an unbalanced mixture of two DNA polymerases, at least one lacking significant 3*- 
exonuclease activity and at least one exhibiting significant 3*^exonuclease activity. 
30 PCR = (noun) 1. The Polymerase Chain Reaction 

2. One such reaction/amplification experiment. 3. (verb) To amplify via the 
polymerase chain reaction. 

ul = microliters) 

r 

ATCC = American Type Culture Collection 
35 Megaprimer = double-stranded DNA PCR product used as 

primer in a subsequent PCR stage of a multi-step procedure. 
Deep Vent = DNA polymerase from Pyrococcus species GB-D; purified enzyme is 
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available from New England Biolabs. 

Deep Vent exo- = mutant form of Deep Vent DNA polymerase lacking 3 '(editing)- 

exonuclease. 

Vent = DNA polymerase from Thermococcus litoralis; purified enzyme is available 

from New England Biolabs. 

Vent exo- = mutant form of Vent DNA polymerase lacking 3\editing)-exonuclease. 



Pfu = DNA polymerase from Pyrococcus furiosus lacking 3' (editing) -exonuclease; 
purified enzyme is available from Stratagene Cloning Systems, Inc. 

Pfu exo- = mutant form of Pfu DNA polymerase ; purified enzyme is available from 

10 Stratagene Cloning Systems, Inc. 

Sequenase = A chemically modified or a mutated form of phage T7 or T3 DNA 
polymerase wherein the modification or mutation eliminates the 3 '-exonuclease activity. 
RRTPF DESCRIPTION O F THE DRAWINGS 

Figure 1 depicts the nucleotide sequence of primers that can be used for amplification 
15 of the gene for a preferred embodiment of the DNA polymerase of this invention. The bulk of the 
DNA sequence for the gene (between the primers) and the resultant amino acid sequence of the 
enzyme, is defined by the indicated GenBank entry. 

Figure 2 depicts the nucleotide sequence of the same primers as in Figure 1, and 
shows that these same primers can be used for amplification of the analogous gene from Thermus 
20 flavus . 

Figure 3 is a photograph of an agarose gel depicting a PCR amplification reaction 
conducted using the prior art enzyme Thermus aquaticus DNA polymerase (AmpliTaq; AT) and a 
preferred embodiment of the DNA polymerase of this invention, tested with differing peak dehatur- 
ation temperatures lasting a full 2 min each for 20 cycles. Full activity at 98° and partial but useful 
25 activity at 99° is exhibited by the preferred embodiment of this invention, whilst AT is unable to 

withstand these temperatures. Figure 3 demonstrates that the enzyme of the present invention is more 
thermostable than AT in a practical test - PCR amplification. 

Figure 4 is a photograph of an agarose gel depicting a PCR amplification reaction 
conducted using 4 enzymes: the prior art enzyme Thermus aquations DNA polymerase (AmpliTaq; 
30 AT); the DNA polymerase of this invention (KlenTaq-278); the prior art enzyme AmpliTaq Stoffel 
Fragment (SD; and KlenTaq-291. All were tested with PCR denaturation steps carried out at 95«C 
(control standard temperature), and at 98<>C. All were tested at two levels of enzyme, the lower level 
being as close as practicable to the minimum necessary to support the reaction at the control 
temperature. 

35 N ote that both KlenTaq-291 and ST behave identically, losing most, but not all, of 

their activity when used at 98* C, yet KlenTaq-278 is at least twice as able to withstand use of the 
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higher deaatunuion tenmerature. AT is seen to be drastically reduced in effectiveness by exposure to 
98' C. Tne behaviour of these enzymes is reproducible except for ST, which is at its best in the 
presented experiment, but performs more poorly when used in the amounts recommended by the 
manufacturer. 

Figure 5 is a photograph of an agarose gel analysis of the products of colony PCR 
carried out at the standard peak denaturation temperature of 95° C compared to the newly available 
temperature of 98° C allowed^ the enzyme of the present invention, figure 5 demonstrates an 
application advantage of the use of the newly available peak denaturation temperature. 

Figure 6(A-Q is three photographs, each of an agarose gel on which was loaded a 
portion test PCR experiment, figure 6 demonstrates the large increase in efficiency of large 
DNA span PCR achieved by variations of a preferred embodiment of the enzyme formulation of the 
invention. Although KlenTaq-278 or Pfu DNA polymerase, alone, are shown to catalyze a low level 
of 6.6 kb PGR product formation, various'combinations of the two are seen to be much more efficient 
Wr and lower amounts of Pfu in combination with Klentaq-278 are seen to be effective, down to 
the minimum presented. 1/640. Of those shown, only a combination of Klentaq-278 and Pfu car, 
catalyze efficient amplification of 6.6 kb. Per 100 ul, the indicated level of each enzyme (see 
Methods, example 8. for unit concentrations) was used to catalyze PCR reactions templated with 19 D g 
XplacS DNA and-primers MBL and MBR. 20 cycles of 94' 2 min., 60' 2 min., 72« 10 rain. 

Figure 7 is a photograph of an agarose gel on which were analyzed the products of 
PCR experiments to test the performance of an embodiment of the invention in catalyzing the 
amplification of fragments even longer than 6.6 kb. Figure 7 demonstrates the ability to amplify 8 4 
kb, 12.5 kb, 15 kb. and 18 kb with high efficiency and large yield, utiliziing the 1/640 ratio 
embodiment of the enzyme formulation of the invention. Target product size is indicated above each 
lane as kb:. I*vel of template per KX) ul is indictfed as ng X:. 20 or 30 cycles of PCR were each 2 
sec. 94«, 11 min. 70'. These early amplifications were non^ptimal in several respects compared to 
the current optimal procedure (see Methods, example 8): thick-walled tubes were employed instead of 
thin, catalysis was by 1 ul KlentaqLA-64 (63:l::Klentaq-278:Pfu) instead of KlentaqLA-16. the 27mer 
primers were used (see Table 1) instead of longer primers, the extension/atmealling temperature was 
70' instead of 68', and the Omnigene thermal cycler was used. 

Figure 8 is a bar graph enumerating the differences in the number of mutations 
introduced into a PCR product, the lacZ gene, by the near fidl-length prior an -3 deletion ofThermus 
aouaticus DNA polymerase, compared to the number of mutations introduced by Klentaq-278. 

Rgure 9 is a photograph of an agarose gel depicting a PCR amplification attempted 
using a 384 bp megaprimer (double-stranded PGR product) paired with a 43-mer oligonucleotide 
primer BtV5. Per 100 ul of reaction volume, the following enzymes (see Ex. 8. Methods for unit 
concentrations) were used to catalyze ampUfications: lane 1. 1 ul Pfu DNA polymerase; lane 2. 1/16 
ul Hu; lane 3. 1 ^ Klentaq-278; lane 4. both enzymes together (1 ul Klentaq-278 + 1/16 ul Pfu). 
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Klentao-278 XH3 = lambda DNA digested with HindHI. The on* successful amphficaaon resulted 
ta. the combination of the two enzymes (Une 4). Vent DNA polymerase could substitute for Pfu 
with the same result (data not shown). 

Rgure 10 is a photograph of an agarose gel demonstrating that 33mers are better 
.nan 27mers. Per 100 ul of reaction volume. 2 ng Canes 1-0 or 10 ng Oanes 7-12) of lambda 

(lan es «. 10-12). Besides U longer, the 33mer lambda primer sequences wee ^ «0*» 

h the — of 1.2. 1.4. and i.6 u» was used to catalyze the ampUficatrous of 12.5. 15. « 18 kb. 
lively. 15 ul aliquots (equivalent to 0.3 or 1.5 ng of X template) were analyzed by 0.8% 

agarose electrophoresis. 

Figure U is a photograph of an agarose gel showing a CHEF pulse-field analys.s 
(ref ii 4sw . S w i ^gtime)oflargePCRprc4ucUamplifiedbyKlenu qL A-16(1.2ul)under 
conditions which were suboptimal with respect to P H (unmodified PC2 buffer was used) and thermal 
cycler (Omnigene). Starting template (see Table 0 was at 0, ng/ul and the time at 68' - each cycle 
was 21 min. for products over 20 kb, 13 min. for lanes 4 * 5. and U min. for lanes 1 1-14. The 
volum es of PGR reaction product loaded were adjusted to result in approximately ^ , « - 
ul - 12 12,4.2; i0.10.10; 2.2.4.!. The standard size lanes (S) show full-length Xplac5 DNA (48645 
* ^mixedwitbaHindmdigestofXDNA. As for Tab.e 1. the sizes in 5 figures are in base p... as 
predicted from the primer positions on the sequence of X P lac5 DNA. and si.es w„h decrma, pomts are 
in kb, as determined from this gel. 

Figure 12 is a photograph of an agarose gel depicting 28 kb and 35 kb products 

product was cycled with 24 min. extension times, bom in the RoboCycler at opttmum P H s* Ex. 8. 
Memoes). Lanes S (1.4.7) contain markers of undigested XplacS and Hmdm-digested *.ac5 DNA. 

Figure 13 is a photograph of an agarose gel showing the resuhs of a Pfu ex* mutant 
test PCRamplificationof8.4kbby 30 units (0.7 ug) of Klen.aq-278 alone 0-es 1.7)andm 

archaebacteria, Pfu wild type exo* DNA polymerase ( + ; hu.es 2.3) or a mutant ^f Uc^ng the 3 - 
exonuclease activity (-; Unes 4.5). Une 6 is the result if 1 (2.5 units) of so,ely Pfu DNA 
polymerase (wl. exo + ) is employed. 

r— ™™ ™ ™E PRE FFPFFO EMBODIMENT hPrRofthe 
35 ^ferrh^w to Rgure I, the primers and logic for amphficauon by PGR of the 

^binan, DNA sequence encoding a preferred embodiment of the DNA polymerase 



of the invention 
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(referred to herein as Klentaq-278). are set forth. As depicted in Figure 1, an initiator methionine and 
a glycme residue occupy the first two N-termina. positions of Klentaq-278. previous* occupied by 
residues 279 and 280 of WT 1^^^^ DNA polymerase, fol.owed by the amino acid 
sequence of wUd-type m SS! ^ aM ^ DNA po.ymerase, beginning with the amino acid residue a, 
position 281 as described by Uwyer The codons eroding amino acid residues 1 through 280 of 
B?rSMi ^ ° NA « th - fo - and the amino acids 1 thru 280 are not 

present m the resulting gene product. Another preferred embodiment of the DNA polymerase of the 
■nvention is depicted in Figure 2. In mis embodiment, me same deletion mutation described above is 
made to the highly analogous enzyme Thermus f| a y,,s DNA polymerase. 

It will be appreciated that minor variations incorporated into the DNA encoding for 
or the amino acid sequence as described herein, which retain substantially me amino acid sequence as 
set forth above, and which do not significantly affect the thermostability of the polymerase are 
included within the scope of the invention! 

Surprisingly, the mutant DNA polymerase Klentaq-278 exhibits thermostability at 
temperatures above those reported for any previous variant of 3 her JS us Jto DNA polymerase 
and has demonstrated a fidelity in final PGR products which is greater than that of WT Ihermus 
abacus DNA polymerase, when bom are utilized a, the 72' C temperatures recommended for DNA 
synthesis. Further, since Klentaq-278 does no, have the S'-exonucJease activity associated with 
Itemumtim DNA polymerase (removed as a consequence of the N-terminal deletion) it is 
s.gmficantiy superior to wild-^pe J^s^^ DNA polymerase for DNA sequencing 
Mutagenesis results, and mismatched primer testing, suggest mat Klentaq-278 is less processive and is 
less Idcely to extend a mispaired base man wi.d-type Ile^usjiaua^ DNA polymerase. 

Thermostability teste with Klentaq-278. Stoffel Fragment (ST, alternative designation 
Klentaq-288) and Klentaq-291 have been carried out. The test used involves 20 PCR cycles with a ' 
full 2 minutes each at me peak test temperature, such as 97' C. 98' C. or 99« C. and the intensity of 
the resulting amplified bands is compared to 2 minutes at 97' C. or a. a lower control denaturation 
temperature, such as 95 • C (at which al. of these variants are stab.e). These data indicate that ST and 
Klentaq-291 behave similarly, having thermolability at 98' C that is similar to each other ye. distinct 
from Klentaq-278. which exhibits little detectable thermolability a. 98' C in these tests. These data 
suggest that the number of N-terminal amino acids is important to the enhanced thermostability 
exh,b,ted by the DNA polymerase of the invention. Evidently deletions ST and Hentaq-291 are in a 
class which has removed too many amino acids (10 more and 13 more) for the optimum stability 
demonstrated by the invention Klentaq-278. 

The DNA polymerase from the bacterium sometimes designated ThsmusJIavus (and 
sometimes Thermus aquaticus - see ATCC catalog) is highly homologous to the WT Thermus 
aauaticu, DNA polymerase. In me region of the deletions being discussed here, the enzymes and 
genes are exactly homologous, and it is believed that the differences between the pair ST, Klentaq-291 
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• vwa 278 would remain if the analogous deletions were constructed. Indee.*. the 

S within the scope of this invention. 

The invention also features a vector which includes a recombinant DNA sequence 

excludes the N-terminal 280 amino acids of wild-type Jhennusjiguaticus DNA polymerase (see 

10 Lawyer et al., supra). 

h preferred embodiments, tie vector is that nucleic add present as plasnud 

pWB254b (SEQ ID NO:5) deposited as ATCC No. 69244 or a host cell containing such a vector. 

In a related aspect, the invention features a purified DNA polymerase having an 
^.Idsequenceasdiscussedabove. As used herein, 

polymer is at least 10* (w/w) of the p^in of a preparation. Even more preferably. ,t 
provided as a homogeneous preparation, e.g., a homogeneous solution. 

in general, the recombinant DNA sequence of the present invention » amphfied from 

20 polymerase gene which is larger than the desired span, using the po.ymerase cham react.on fl>CR, 
^ et al . Science 239:487. X988), employing primers such as those in Figure 2 into winch 
appropriate restriction sites have been incorporated for subsequent digesuon. 

The recombinant DNA sequence is then cloned into an expression vector using 

V ^crestrictionen.messuch.NcoXandHmdm. Then, afte, optional aBcalme phosphate 

of me target codons in place adjacent to desired contn.1 and expression sequences. The particular 
vector employed will depend in part on the type of host cel. chosen for use in gene expression. 

,. . ctflnce ma also containing suitable promoter and terminator sequences. 
30 tetracycline resistance, ana aiso conwmiuB r „_-«„. 

In a preferred procedure, the recombinant DNA expression sequence of the present 
invention is cloned into plasmid P WB253 (expresses KlenTaq-235 deposited as ATCC No. 68431) or 

University), . P BR322 derivative. The specific sequence of the resulting plasmid, designated 
frequently used as host cells for expression of DNA polymerase, although techniques for using more 
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complex cells are known. See, eg., procedures for using plant cells described by Depicker A e. 
-I.. LJ&SL6EELS&1 (1982) 1:561. I^oji host strain X7029, wild-type P. having deletion X 74 
covermg the Jae operon is utilized in a preferred embodiment of the present invention. 

A host cell is transformed using « protocol designed specifically for the particular 
host cell. For l^coji. a calcium treatment. Cohen, S.N., Proc. Natl. Acad. Sci. 69 2110 (1972) 
produces the transformation. Alternatively and more efficiently, electroporation of salt-free E . ^ is 
performed after the method oJDower etaK (1988). Nucleic Acids Research 16:6127-6145 After 
^formation, the transformed host, are selected from other bacteria based on characteristics acquired 
from the expression vector, such as ampicillin resistiu.ee, and then the transformed colonies of bacteria 
are further screened for the ability to give rise to high levels of isopropylthiogalactoside 
aPTO^ducedttermosUbleDNA polymerase activity. Colonies of transformed E coli are then 
grown in large quantity and expression of Klen<aq-278 DNA polymerase is induced for isolation and 
purification. 

Although a variety of purification techniques are known, all involve the steps of 
disruption of the E^oJJ cell, inactivation and removal of native proteins and precipiution of nucleic 
actds. The DNA polymerase is separated by taking advantage of such characteristics as its weigh, 
(centnfugation). size (dialysis, gel-filtration chromatography), or charge (ion^xchange chroma- 
tography). Generally, combinations of these techniques are employed together in the purification 
process. In a preferred process for purifying Klentaq-278 the Ecoji cells are weakened using 
lysozyme and 4c cel.s are lysed and neariy all native proteins are denatured by heating the cell 
suspension rapidly to 8 0 o C and incubating a, 80-81' C for 20 minutes. Th e suspension is then cooled 
and centrifuged to precipitate toe denatured proteins. nt supematant ^ ^ ^ ^ 
undergoes a high-sal, polyethy.ene-imine treatment to precipitate nucleic acids. Centrifugation of the 
extract removes toe nucleic acids. Chromatography, preferab.y on a heparin-agarose column, results in 
nearly pure enzyme. More detail of toe isolation is set forth below in Example 3. 

The novel DNA polymerase of the present invention may be used in any process for 
whtch such an enzyme may be advantageously employed. In particular, this enzyme is useful for PCR 
amplication techniques, nucleic acid sequencing, cycle sequencing, DNA restriction digest labelling 
and blunting, DNA labelling. ta yjvo DNA footorinting. and primer-directed mutagenesis. 
30 Amplification 

Polymerase chain reaction (PCR) is a method for rapidly amplifying specific 
segments of DNA, in geometric progression, up to a million fold or more. See, e.g MullisUS 

^ 4 ' 683 ' 202 ' Which iS ""T^ «"■* * ref — I** technique relies on repeated 
cycles of DNA polymerase^alyzed extension from a pair of primers with homology to toe 5' end 

andtotoecomplernentoftoea-endoftoeDNAsegn^ttobea^lified. A key step in toe process 
■s toe hea, denaturing of toe DNA primer extension products from their templates to permit another 
round of amplification. The operable temperature range for toe denaturing step generally ranges from 
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about 93oC * about 95<>C, which irreversibly denatures most DNA polymerases, necessitating the 
addition of more polymerase after each deration cycle. However, no additional DNA polymerase 
needs to be added if thermostable DNA polymerases such as DNA polymerase are 

used since they are able to retain their activity at temperatures which denature double-stranded nucletc 
acifc As described in Example 4. below. Kleu.aq-278 has demonstrated the ability to survive 
^gful repeated exposure to temperatures of 99oC. higher than for any previously known DNA 

polymerase. / , 

Klentaq-278 has also been demonstrated to have a higher fidelity than wld-type 

j^^^ DNA pdy-ner.se at 7 2 °C, the recornmeaded synthesis temperature. ^ data for 
.bis has been gathered by a method involving the PGR amplification of a lacZ DNA gene flanked by 
two selectable marker, [Barnes. W.M. (1992) Gene 112, 29-25]. Representative data companng the 
preferred embodiment of this invention Klentaq-278 to AT and another analogous N-termmal deletion. 
Klentaq-235. is shown in Figure 8. which demonstrates that different N-terminal deletions reproducb.y 
exhibit differing fidelities as measured in the final PCR product. 

Similar fidelity data for the enzyme ST is not available, since it is difficult for the 
commercial preparation of this enzyme to catalyze PCR of the long test fragment (4.8 kb) used for mis 
assay It is no, yet known whether the difficulty with ST for these experiments is caused merely by 
formulation (its concentration is less, such mat 10-15 times more volume is necessary for a 2 kb PCR 
amplification, and for these deletions more enzyme is needed for longer target DNAs), or whether ST 
, may be intrinsically unable to catalyze such a long-urge. PCR amplified. 
DNA Sequencing 

Particular DNA sequences may be elucidated by the Sanger Method (Sanger, F., 
Nicklen S. and Coulson. A.R., DNA sequencing with chain-terminating inhibitors. 
Proc Nat Acad.Sci.USA, 74 (1977) 5463-5467). using dideoxy analogs. DNA polymerases are used 
* in these methods to caUuyze the extension of the nucleic acid chains. However, in its natural form, 
Jte^ DNA polymerase (like many other polymerases) includes a domam for 
^nuclease activity. Tins associated exonuelease activity can. under certain conditions includmg 
the presence of a slight excess of enzyme or if excess incubation time is employed, remove 1 to 3 
nucleotides from the 5' end of the sequencing primer, causing each band in an alpha-labelled 
30 sequencing gel to appear more or less as a multiple.. If the label of the sequencing gel is 5". the 

exonuelease would not be able to cause multiples, but it would instead reduce the signal. As a result 
of the deletion of the N-termmal 280 amino acid residues .fB««« DNA polymerase, 
Uentaq-278 has « exonuelease activity and it avoids the sequencing hazards caused by S^xonuclease 

Activity 

, 5 Klentaq-278 can be used effectively in thermostable DNA polymerase DNA sequen- 

cmg. mere 85:9436, 1988) and 

original dideoxy (Sanger sLsL el *■> NaU - Acld " 
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fanis «jL describe , good procedure for dideoxy sequencing, but when the WT 

thee^ d^,,^^^^ Klentaq-278 is as effective in curing this 
problem as the subject of that patent application. Klentaq-235. a.k.a. DeltaTaq. 

The procedure recommended for orignal-type (non^ded, Innis eLah) dide0]ty 

wfcch ,s ap^ded to Uns apphcaUon (Appendix 1) and the ^e is who,,v in^ by ^ 

The procedure recommended for cycle sequencing is that in the USB Cycle 

^r^' ^ ^ " ^ *" ^ h SPPended '° ^ * -d the 

article is wholly incorporated by reference. 

Other Uses 

inese procedures are discussed below. 

Klentaq-278, especially in the formulation Klentaq-Ul (discussed below,, can be 
used ,o extend site-specif, mutagenic primes) which are am.eal.ed to single-stranded templates It 
for Klenow enzyme (the Urge fragment of E_coli DNA polymerase „ and T7 A poly- 
merase m thrs process, showing more primer selectivity a, SO^C than Klenow enzyme a, 37o C 1 
^ to completion or suffix, miration in n mins.. as compared to tiZ hour or l 
required for Klenow enzyme. 

Klentaq-278 has aJso been shown to be useful (and superior to wild-typefhermus 
« DNA po.ymerase) for the post-PCR .abelling steps with the third (second nl)^ 
J^^-W^^fc^ lam indebted to,. Ghana, Washingl 

I^-IO^T 8MOmiC SeqUenCiD8 to ^ f00tpriDtk «- Acad Sci U S A 8_9. 

K.e 0taq - 2 78isalso lB e fiu forDNAUbe U ing. For random primers, a .engm of a, 
I- 9 - ,s recon^nended. and preferably the reaction is wanned slowly (over 20-30 mins.) from 37 
• 65.C. Most preferably, a prognmm^ hea, b,o*. using procedures welHmowr, to those in this 
art, is utilized for the DNA labelling. 

Another use of Klentaq-278 is for the of Umbda DNA restriction digests 

, tr '~ ** ~* ^ *"* ^ ^ « resuU of induding Klentaq-278 and me 
four DNA dNm in wim a Hmdm digest performed a, 55o C . bands , and 4 are no, panialiy atuched 
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Strain pWB254b/X7029 was deposited with the American Type Culture Collection, 
Maryland, on Februa* 18, I9» and assigned the number ATCC 69244. Applicant ac^owledges his 
^nsibility to replace this culture ^11*1^^--^^^ — ^ 
5y l after the last ^uest for a culture, or 30 years, w^ 
t0 notify the depository of the 

av Jetothe P ubli, uithattimemede^itswmbemade^ 

, ». . „ f a7CFR Section 1-14 nad 35 U.S.C. §112. 
Patents under the terms of 37 c.r.i^. ^ uuu A 

^^^C of .he invent . Urge, leng* fe**. «o PGR ampl.ficaUon of 

DNA has been ioentmed and addressed. Concomi.anuy , ft. base pair fidelity, ft. to use PGR 
pllas primers. ^ .he^yie.dof^et^eo, were inereasea. 

activity (for example, Pfu. Ven, or Deep Vent). Surprisingly, .arget fragments of at ,eas, 35 

Moreover, products in .he range 6.6 to 8.4 kb can be effic^ amphfted by a 
formation of ftermosuble DNA polymerases consisting of a majority component comprised of a. 
2 one thermostable DNA polymerase lacking 3-onuclease activity and a mmonty -mponen, 
, comnrised of at least one ftermosUble DNA polymerase exhibiting ^nuclease actmty. 
3 C ° ffiPn Z prior at technology only allowed relatively inefficient and sporadic amplify 

of frag ments in this si, range, resulting in only relatively faint product bands or no detectabU proouct 
, ali in light of me curren. discovery. I believe I understand (wUhou, HmUtng myse.f to any 

« DNl polyme^ and » variants a. s,ow • extend a mismatched base pair (whic *ey «umo 
Zemey lack any ^nuclease). A couple of companies (New England Biolabs and 
"JTJSI — ^w^e^aS.editm.exonucleasew.ch 

unreliability of these enzymes for W 

degrades ft. primers so fta. UtUe or no PCR is possible. T* -tack problem . worsefcr 
rlan others It has been reported (Anonymous, The NEB Transcript, New England 

35 anneallmgcondiuons allow mat „ n H the S' 15 nt of fte primers must be 

only allow annexing at lower, non-selective temperatures, and me 5 15 n. o 

exactly homologous to the template. 
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I have discover that to effects of . 3 - <xonuclease ^ ^ ^ 

an unexpectedly minute presence of one or more DNA pdymeruses which exhibit significant (defined 
as betng b,ochemicaIly assayable) 3'^xonuclease activity (herein called 'E2') such as certain 
A*haebacteria. DNA polymerases, whilst efficient extension is being catalyzed by a Urge amount of 
one ormore DNA polymerases which lack any significant 3 '^nuclease activity, such as Klentao-278 
or AT (herein called "El '). As a minority component of a formulation or mixture of DNA 
po.ymerases, the unreliability apd inefficiency of the 3-.xonuc.ease DNA po.ymerase discussed 
above, is substantiaUy reduced or eliminated. Moreover, since it is beHeved mat the 3-.xonuc.ease is 
removing mismatches to diminate pausing at the mismatches, the resulting DNA exhibits fewer base 
pa* changes, whieh is a valuable decrease in the mutagenicity of PGR without sacrificing flexibiHty 
specfiety, and efficiency. In fact, the combination, eve. for KlenTaq-278/Pfu unite ratios as high as 
2000. exhibited great* increased efficiency of amplification. For most applications, the mixture of 
DNA polymerases must be at a retaive DNA polymerase unit ratio of El to E2 of a, leas, about 4- 1 
before enhanced product length and yield cat, be achieved. When Pfu DNA polymerase was used in' 
the formulation, the ratio preferab.y is in the range 80 to 1000 parts KlenTaq-278 per part (unit) Pfu 
more preferably from about ,50 to about ,70:1. and most preferab.y, is about ,60:1. depending ' 
somewhat on the pnmer-.emp.ate combination. Similar ratios are preferred for mixtures of Pfu and 
Klentaq-291. 

If Deep Ven, is substituted for Pfu for use in combination with Klentaq-278 or -29, 
the most preferred ratios for most app.ications increases to from about 450 to about 500-, El to E2- if 
Mi-length (WT, Taa or Amplitaa. is included as El, the most preferred ratio to Pfu or other E2 ' 
component is between about 10 and about ,5:1 of El to E2. 

E2 of the invention includes, but is not limited to. DNA polymerase encoded by 
genes from Pfu. Vent. Deep Vent. T7 coliphage. Tma. or a combination thereof. El of the invention 
mcludes. but is no, limited to. a mutant, 3-.xonuc.ease negative form of at, E2 DNA polymerase or 
-uternativdy, . DNA polymerase which, in unmutated form, does no. exhibit significant 3'- ' 
exonuclease activity, such as the DNA polymerases encoded by genes from Taa. Tfl. or Tth. or a 
combination thereof. 

As discussed below, the formulation of DNA polymerases of the present invention 
also indudes formulations of DNA polymerase wherein El comprises a reverse transcriptase such as 

Sequenase. 

Additional examples of the formulations of the present invention include mixtures 
wherem E, comprises or consists of a mutant or chemical modification of T7 or T3 DNA polymerase 
and E2 comprises or consists of a wild-Type T7 or T3 DNA polymerase, or. in another variation El 
composes or consists of a Vent DNA polymerase lacking any significant 3'^onudease activity (sold 
by New England Biolabs as Vent exo-) and E2 comprises or consists of Vent. 

The prindpal here discovered, namely the use of low levels of 3' exonuclease during 
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polymerase primer extension, including nonnal tempore incubations (i.e. using non-thermostabie 
DNA polymerases) and including reverse transcriptase enzymes, which are known to lack a y- 
(editing) «onuclease(Battula & Loeb, 1976). An example of the former is the use of Sequenase 
5 (exo-) as the majority enzyme, and wild-type 17 DNA polymerase (exo + ) or Klenow fragment as the 
ttinority component. An example of the latter is AMV (Avian Myoblasts Virus) or MLV (Murine 
^a Virus) Reverse Transcriptase as the major component, and Klenow fragment. T7 DNA 
' po lyn ,erase. or a thermostable DNA polymerase such as Pfu or Deep Vent as the mmor component. 
He of the lower activity of thermostable DNA polymerases a, the fcmpenuures of 37 degrees and 
10 4 2 degrees used by these reverse transcriptases, higher levels are likely to be require, man are use, ,n 
PGR Although Klenow fragment DNA polymerse is not a preferred DNA polymer using RNA as 
a template, it does function to recognize this template (Karkas, !973; Gulati. Kacian * Spiegelman, 
^.particutarlyinmeprcsenceofaddedMnion. Added Mn ion is routinely used to achieve 
^Ltription by thermostable DNA polymerase Tth. unfortuna^y (in me pnor art) without the 
benefit of an exo + component. It must be stressed mat for the use of the exo + ccmponen t for 

extra care must be taken to ensure that the exo+ component is entirely 
reverse transcriptase reactions, extra care mu« " 

free of contaminating RNAse. 

-The following references describe methods known in the art for using reverse 

transcriptases, and are hereby incorporated by reference. 

Battula N. Loeb LA. On the fidelity of DNA replication. Lack of 
exodeoxyribonuclease activity and correcting function in avian myelob.astosis virus DNA 
polymera s e . i^sm^^^' ISUWM, 1976 Feb 25. 

Gulati SC. Kacian DL. Spiegelman S. Conditions for using DNA poiymerase 1 as 

an RNA-dependent DNA polymerase. ^s^din^^ 



15 



25 w«nf America . 71(4) :1035-9, 1974 Apr. 

K^as jo. Reverse transcription by Escherichia coli DNA polymerase 1. 

Sd 13 s A . 70(12):3834-8, 1973 Dec. 

DNA Polymerase with no polymerase activity, only 3'-*xonuclease activity: 

While not limiting himself to a particular theory, applicant believes that the 

po.yme.se activity. In fact, it is further believed that this DNA polymerase activity is potentially 
Llesome. lading to unwanted synthesis or less accurate synthesis under conditio^ptmn^ for 
men^ority(Xl)DNApolyme^»mponent.nota.mino ri ,one. 

^ salas (1990) Site^irectod mutagenesis of the YCDTDS amino acid motif of the ph, 29 DNA 
35 polymcase. Gene who mutated me -Region V DNA conserved DNA polymerase mot, of 
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which has been sequenced by [Uemori.T., Isbiho.Y., Toh.H., Asada.F. and Kato.I. Organization and 
nucleotide sequence of the DNA polymerase gene from me archaeon Pyrococcus furiosus. Nucleic 
Acids Res. 21, 259-2*5 (1993)]. ' 

In a further embodiment of the present invention, a formulation of DNA polymerase 
is provided including or consisting of at least one DNA polymerase which, in wild-type form exhibits 
3Exonuc.ease activity and which is capab.e of catalyzing a temperature cycle type polymerase chain 
, reacuon in which the 3'-exonuclease activity of the one or more DNA polymerases discussed above 
has been reduced substantially, but not eliminated. The diminished 3 Exonuclease activity is obtained 
by mutation, or by chemical or other means known to those in this art. This formulation of DNA 
polymerase may then be used to catalyze primer extension in PCR amplification. Tbe Spanish 
researchers [Soengas. M.S.. Esteban. I.A., Uzaro, J.M. Bemad. S., Blasco. M. A...1 Salas. M and 
Blanco, L., Embo Journal 11:4227-4237 (1992)], studying a distantly related DNA polymerase of the 
alpha class which includes Pfu. Vent and Deep Vent DNA polymerases, also identified and 
demonstrated mutation of the 3 '-exonuclease domain as well separated and easily avoided while one is 
mutating the DNA polymerase motif(s). In this same report, they demonstrate that the exonuclease 
can be reduced to 4-7* activity by a change of the conserved Ty, residue to a Phe or a Cys, whilst a 
reduction to only 0.1* activity is obtained by replacing the conserved Asp with Ala. For long and 
accurate PCR, the optimum exo- (3 Exonuclease negative) mutation to create in a thermostable DNA 
polymerase such as Pfu. Vent or Deep Vent, is one that reduces but does not eliminate the 
exonuclease. for instance a reduction to 0.2-7%. preferably. 0.5-7*. and most preferably. 1-7* of 
the 3'-exonuclease activity of the wild-type DNA polymerase. 

As an alternative way to introduce the mutations (and as demonstrated for the Bt 
CryV gene example above) Klentaq-I^, the current invention, is used to introduce the changes into a 
small PCR product spanning the DNA sequence coding for the two homologies REGION 01 and 
REGION I. The changes are chosen to change conserved amino acids in REGION m and/or 
REGION I, using as a guide the conserved motifs displayed below with the aid of the MACAW 
computer program. 

I (data details not shown, but analogous to Example 1 above and readily carried out 
by one skilled in the art) have PCR-ampIified the ORF of the Pfu DNA polymerase gene from Pfu 
DNA. using the sequence published by Uemori et aj (1993) as a guide. Analogously to Example 2 
above. I have cloned this ORF into the same expression vector as used for expression of Hentaq-278, 
and I have shown that the DNA polymerase can be purified by the same procedure as described here 
for Klentaq-278 in Example 3 above. 

As an alternative way (alternative to Soengas et. al. suonrt to introduce the mutations 
(and as demonstrated for the Bt CryV gene example above) Wentaq-I^. the current invention, would 
be used to introduce the changes into a small PCR product spanning the DNA sequence coding for the 
two homologies REGION ID and REGION I. The changes would be chosen to change conserved 
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« ids in REGION m and/or REGION I. using as a guide the conserved motifs displayed beiow 
with the aid of the MACAW computer program. 

The following is numbered output from the computer program MACAW, 
demonstrating two of the DNA polymerase conserved motifs. A useful presentation of these and other 
DNA polymer motifs can aJso be found in Perler et d (1992) P.N.A.S. 89. 5577-5581. 

SEQ ID NO:30 REGION IJI 

Phi29 KlJvlU^SLYGkfasnpdvtgkvpylkengalgfrlg«etkdpvytpmgvfITA 435 



Pfu dyrqkaiKLLANSFYGyygyakarwyckecaes VTA 516 

10 SEQIDNO:31 



SEQlDNO:32 REGION I 

Phi29 WARyttitaaqacyd-^^ ^ 

15 Pf.* WGRkyielvwkclcckfgfKVLYIDTOGLYAT, P gg 576 
SEQ ID NO:33 
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The following examples illustrate the invention. 

EXAMPLE 1 

r^tmction of «" Fusible Gene for Klentaq-278 
In order to construct the Klentaq-278 DNA polymerase gene having a recombinant 
DNA sequence shown as the nucleotide sequence of Figure 1. the following procedure was followed. 

„ frnm o 25 «e of total TTiermiis aouaticus DNA using 

The mutated gene was amplified irom u.zd ug oi wuu * 

ft. polymerase d»in reaction (PCR. Sa*i * Science 239:487. ,988) primed by me .o synthetic 
DNA primers of Figure !. Primer KT1, SEQ ID NO:l, has homology to the wUMype DNA sta^g 
at codon 280; this primer is designed to incorporate a ft* * *° ^ amplified DNA. 
Primer Klenta^, SEQ ID NO:3. a 33mer spanning the stop codon on the other strand of .he 
wild-type gene encoding ™* P*— • - ■ - — 

double stop codon into the product DNA. ¥ „ « , « w M «ri 16 

Tta buffer for the PCR reaction was 20 mM Tris HC1 P H 8.55. 2.5 mM MgCl, 16 
mM (NHJtSO,, iSOug/mlBSA. and 200UM each dNlP- The cycle parameters were 2' 95°, 2' 

65 °' 5 ' 72 °' m order to minimize the mutations introduced by PCR (Saiki et al., su^). only .6 
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cycles of PCR were performed before phenol extraction, ethanol precipitation, and digestion with the 
restriction enzymes Ncol and HindlD. 

^ EXAMPLE 2 

Preparation of an Expression Vector 
The product Ncol and Hindffl fragment was cloned into plasmid pWB254b which 
had been digested with Ncol, fiindin, and calf intestine alkaline phosphatase. Hie backbone of this 
plasmid, previously designated pTAC2 and obtained from J. Majors, carries the following elements in 
counter-clockwise direction from the £vuH site of pBR322 (an apostrophe ' designates that the 
direction of expression is clockwise instead of counter clockwise): a partial lacZ' sequence, lacT, 
!acPUV5 (orientation not known), two copies of the tec promoter from PL Biochemicals 
Pharmacia-LKB; catalog no. 27-4883), the T7 gene 10 promoter and start codon modified to consist of 
a Ncol site, a Hindffl site, the trpA terminator (PL no. 27^884-01), an M13 origin of replication, and 
the ^mp* gene of pBR322. Expression of the cloned gene is expected to be induced by 0. 1 mM 
IPTG. 

Ampicillin-resistant colonies arising from the cloning were assayed by the single 
colony thermostable DNA polymerase assay of Sagner eiaT (1991) [GENE 97:119-23] and 4 strong 
positives were sized by the toothpick assay (Barnes, Science 195:393, 1977). One of these, number 
254.7, was of the expected size except for a small proportion of double insert. This plasmid was 
further purified by electroporation into E. ooli X7029 and screened for size by the toothpick assay, 
20 and one plasmid of the expected size with no double insert containination was designated pWB254b. 
This plasmid was used for the production of Klentaq-278 described herein. 
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EXAMPLE 3 
Purification of Large Amounts of Klentaq-278 
Plasmid pWB254 has a double (tandem repeat) tac promoter and the T7 gene 10 
25 leader sequence, an ATG start codon. a glycine codon and then codons 280-832 of Thermus aouaticii* 
DNA polymerase, then a tandem pair of stop codons followed by the trp transcription tenninator. The 
pBR322-based plasmid vector (pTac2 from John Majors) is ampicillin resistant. The cells are grown 
on very rich medium (see below). Bacterial host X7029 is wild-type P E. coli except for deletion 
X74 of the lac operon. 

30 Medium: Per liter water, 100 mg ticarciUin (added when cool), 10 g Y.E., 25 g. 

Tryptonc, 10 g. glucose, 1XM9 salts with no NaCl (42 mM NajPO,, 22 mM KHjPO.,19 mM NH 4 C1). 
Do not autoclave the glucose and the 10XM9 together, instead, autoclave one of them separately and 
mix in later. Adjust pH to 8 with 5 M NaOH (about 1 ml). Add IPTG to 0.1 mM at 

OD^ - 1 or 2, and shake well at 30° C. From OD = 2 up to 8 or 10, every half hour or so do the 

35 following: 
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1 . Read the pH with P H sticks 5-M>; Adjust to P H 8.5 with 5 M NaOH and swirling 
(2 to 5 ml per liter) whenever the pH falls below 8. 

2. Read and record the GD^ usually as a 1/10 or 1/50 dilution. 

3. This addition of glucose is optional and not necessarily of any value (evaluation of 
this question is incomplete at this time.) Read the glucose level with glucose sticks, and add an 
additional 0.5 % (10 ml of 50%) if the level falls below 0.2%. 

If it is late, (he cells can shake at 30° C all night after the last pH adjustment. 
Alternatively, set them in the cold room if they have not grown much in a few hours. 

Concentrate the cells e.g. by centrifugation in a GS3 rotor for 8 minutes at 8 krpm. 
Pour off the supernatant and add culture to spin more down onto the same pellets. 
Lysis: 

Resuspend the cells in milliliters of TMN buffer equal to twice the packed cell 
weight in grams: (50 mM Tris-HCl pH 855. 10 mM MgCl,, 16 mM (NH^OJ. 

To each 300 ml of cell suspension add 60 mg lysozyme and incubate the cells at 
5-10° C. with occasional swirling for 15 minutes. Then add NP40 or Triton X100 to 0. 1 %, and 
Tween 20 to 0. 1 % , by adding 1/100 volume of a solution of 10% in each. Then heat the cell 
suspension rapidly to 80° C. by swirling it in a boiling water bath, then maintain the cells (fast 
becoming an extract) at 80-81° C. for 20 minutes. Use a clean thermometer in the cells to measure 
temperature. Be sure the flask and bath are covered, so that even the lip of the flask gets the full heat 
treatment. After this treatment, which is expected to have inactivated all but a handful of enzymes, 
cool the extract to 37° C. or lower in an ice bath and add 2 ml of protease inhibitor (100 mM PMSF 
in isopropanol). From this point forward, try not to contact the preparation with any flask, stir bar, or 
other object or solution mat has no, been autoclaved. (Detergents and BME are not autoclavable. The 
PH and ammonium sulfate are also not autoclaved.) The purpose of the autoclaving is not only to 
avoid microbial contamination, but also to avoid contamination with DNA or nucleases. 

Distribute into centrifuge bottles and centrifuge at 2° C. (for instance, 30 minutes 
at 15 krpm in a Sorval SS-34 rotor or 14 h at 4 krpm in a GS3 rotor). The supernatant is designated 
fraction I, and can be assayed for DNA polymerase activity. 

High-salt PEI precipitation 

After rendering fraction 1 0.25 M in NaCI (add 14.6 g per titer), add five percent 
Polymin-P (PH, polyemylene-imine. Sigma) dropwise with stirring on ice to precipitate nucleic acids. 
To determine that adequate Polymin-P has been added, and to avoid addition of more than the 
rMnimum amount necessary, test 1/2 ml of centrifuged extract by adding a drop of Polymin-P, and 
only if more precipitate forms, add more Polymin-P to the bulk extract, mix and retest. Put tie tes, 
35 aliquots of extract back into the bulk without contaminating it. 

To confirm that enough PH has been added, centrifuge 3 ml and aliquot the 
supernatant into 1/2 ml aliquots. Add 0. 2. 4. 6 or 10 ul of 5% PH. Shake, let sit on ice. and 
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centrifuge in the cold. U>ad 15 ul of these aliquot supernatants onto an agarose gel containing ethidi- 
um bromide and electrophorese until the blue dye has travelled 2 cm. Inspect the gel on a UV i ighl 
box for detectable DNA or RNA in the supernatant. For the bulk extract, use about 1/100 volume 
(i.e. 2-3 ml for a 300 ml extract) excess 5% PH over the minimum necessary to remove all DNA by 
the agarose gel test. 

Stir in the cold for at least 15 minutes. Centrifugation of the extract then removes 
most of the nucleic acids. Keep the supernatant, avoiding any trace of the pellet. 

Dilute the PEI supernatant with KTA buffer until the conductivity is reduced to at or 
below the conductivity of KTA buffer with added 22 mM ammonium sulfate. (Check conductivity of 
1/40 dilution compared to similar dilution of genuine 22 mM A.S. in KTA.) Usually this is about a 5- 
fold dilution. 

Chromatography with Bio-Rex 70 (used by Joyce & Grindley) (Joyce, CM. & 
Grindley. N.D.E. (1983) Construction of a plasmid that overproduces the large proteolytic fragment 
(Klenow fragment) of DNA polymerase I of E^oli, Proc. Natl. Acad. Sci. U.S.A. 80. 1830-1834) is 
unsuccessful (no binding), but unavoidable, since without it, the next column (heparin agarose) will 
not work efficiently. We believe that the important function of the Bio-Rex 70 step is to remove all 
excess PH. although it is possible that some protein is removed as well. CM-cellulose does not 
substitute for Bio-Rex 70. 

Pass the diluted PH supernatant through equilibrated Bio-Rex 70 (10 ml per 100 g. 
cells). The polymerase activity flows through. Rinse the column with 2 column volumes of 22 mM 
A.S. / KTA. Our procedure is to set up the following heparin agarose column so that the effluent 
from the Bio-REX 70 column flows directly onto it. 

Heparin Agarose Chromatography (room temperature, but put fractions on ice as 

they come off.) 

Load the Bio-Rex flow-through slowly onto heparin agarose (Sigma; 10 ml per 100 
grams of cells [this could be too little heparin agarose].) Wash with several column volumes of KTA 
+ 22 mM A.S.. then three column volumes of KTA + 63* glycerol + 11 mM A.S., then elute the 
pure enzyme with KTA + 63% glycerol + 222 mM A.S. + 0.5* Thesit (this is more Thesit for the 
final eluate.) 

Pool the peak of polymerase activity or ODVCstarts about at 2/3 of one column 
volume after 222 mM starts, and is about 2 column volumes wide). Store pool at -20° C. 

The storage buffer is a hybrid of, and a slight variation of, AmpliTaq storage buffer 
as recommended by Perldn-Eimer Cetus and Taq storage buffer used by Boehimger-Mannheim: 50% 
glycerol (v/v; 63% w/v) t 222 mM aixunonium sulfate (diluted to about 50 mM for bench-strength sam- 
ples), 20 mM Tris-HCl pH 8.55,0.1 mM EDTA, 10 mM mercaptoethanol, 0.5% Tnesit). 

The "H"** 1 causes some thickening and cloudiness below -10° C. This seems to 
cause no harm, but we suggest you warm the enzyme to 0° C. on ice before aliquoting for use. 
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. - ca. T*;t™ vino 0 5% Twccn 20, which you may want to 

Thcsit replaces the combination of 0.5% Tnton-XlOO, U.:>* a ween j 

consider as an alternative. 

We have had sporadic reports that freezing can inactivate the enzyme. 
Exercise caution in this regard. This question is under current investigation. Storage at -80° 
(after quick-cooling with liquid nitrogen) is being tested and looks promising, but more than 
one freeze-thaw cycle has been deleterious to the enzyme preparation on some occasions. 

Our final yfcld of enzyme from 7 liters (100 g cells) was once 28 ml at a 
concentration of 120,000 units per ml (4 x bench-strength). 

1/4 ul of bench-strength enzyme will support the PCR of a 2 kb span of 
DNA in a 100 ul reaction. Template is 5-10 ng of plasmid DNA. Each cycle consists of 1 
^ 98' C 1 min 65* C, 6 min 72* C. Cycle number is 16-20. Uss enzyme is needed for 
smaller-sized products (1/8 ul for 500 bp) and more enzyme is needed for larger products (1 
ul for 5 kb). 

KTA Buffer P** liter 



20 mM Tris 8.55 10 ml of 2 M 

10 mM BME 07 °* neat 

10% w/v Glycerol 100 %• 

0.1 mM EDTA 0.2 ml of .5 M 

0.1% w/vThesit 10 ml of 10% 



Rour ooration A say 

1 x * (20 mM Tris-HCl P H 8.55, 2.5 mM MgCl 2 , 

M (NH4> 2 S0 4 , 100 ug/ml BSA) 
200-: - ml activated salmon sperm DNA 
25 40 uM each dNTP + 10-50 uCi a-*P-dATP per ml 

To 25 ul assay mix on ice add 0.2 ul of enzyme fraction, undiluted, or 
diluted in 8 ul of 1XPC2 buffer (or a 1/5 or 1/25 dilution thereof.) Prepare standard Kientaq 
or Amplit. r, zero enzyme and total input samples, also. Incubate 10 min. at 72° C. then 
chill. Spot 5 or 8 ul onto filter paper and wash twice for 5 - 10 min. with 5% TCA, 1% PP, 
30 If pieces of paper were used, count each using Cerenkov radiation or hand monitor. If a 
single piece of 3 MM paper was used, autoradiograph for 60'. 

PCR Assay to give 2 kb product. 

Make up 1 ml of PCR reaction containing 50 ng of plasmid pU (a clone of 
an R color control cDNA from maize. PNAS 86:7092; Science 247:449), 200 pmoles each of 
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primers Lc5 (SEQ ID NO:ll) and 1x3 (SEQ ID N0:12), PC2 buffer and 200 uM dNTPs, 
but no enzyme. 

Distribute 100 ul into tube one, and 50 ul into the rest of 8-10 tubes. Add 1 
ul of final pool of KlenTaq to tube one and mix. Then remove 50 ul to tube two and mix 
that, and so on down the series, which will then contain decreasing amounts of enzyme in 
two-fold steps. Cover each 50 ul reaction with a drop of mineral oil, spin, and PCR 16 
cycles at T 95° C, V 65° C.^S* 72° C. 
Final Bench-Strength KlenTaq-278 Enzyme 

Using 63% glycerol / KTA (.5% Thesit) buffer with 222 mM ammonium 
sulfate, dilute the pool conservatively so that 1/4 ul should easily catalyze the amplification 
the 2 kb span by PCR. Do not decrease the ammonium sulfate concentration below 50 mM. 
Store at -20° C 



EXAMPLE 4 
DNA Amplification 
As reported in Figure 3, a PCR amplification assay to produce 2 kb of DNA 
product was conducted using Thermus aquations DNA polymerase (AmpliTaq) (prior art 
DNA polymerase) and Klentaq-278. To test polymerase thermostability at elevated 
temperatures, the DNA denaturation step of the PCR amplification reactions were conducted 
for 2 min. at 97o C , 9 8o C and 99o C , respectively, using graduated concentrations of DNA 
20 polymerase. 

Hie amplification procedures used followed approximately the protocol for 
amplifying nucleic acid sequences outlined by Saiki et_aL Science 239:487, 1988. A 1 ml 
reaction mixture was prepared containing 100 ng of plasmid pLC, 200 pmoles each of 
primers US (SEQ ID NO:ll) and 1x3 (SEQ ID NO:12), reaction buffer (20 mM Tris-HCl 
pH 8.55, 16 mM ammonium sulfate, 2.5 mM MgCl 2 and 150 ug/ml BSA), 200 uM dNTPs, 
but no enzyme. 100 ul of the reaction mixture was placed into tubes. Aliquots of AmpliTaq 
and Klentaq-278 were then added and 20 cycles of PCR were undertaken. 

Figure 3 shows the results of the experiment to compare the practical 
thermostability limits. The only change between the 3 panels shown is the temperature of the 
2 min. denaturation step: 97* C, 98* C, or 99* C. A range of enzyme concentrations was 
used in order to be able to detect small effects on the effective PCR catalysis activity. The 
template was 10 ng of pLc (a clone of an R color control cDNA from maize. PNAS 86:7092, 
Science 247:449). The primers were 1x5 (SEQ ID NO: 11) and Lc3 (SEQ ID NO: 12). Other 
details of the reactions are given in the assay section of Example 3. 

It can be seen in this experiment that 98° C was not detectably detrimental 
to KlenTaq-278, yet AT was nearly completely inactivated by this temperature. 
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In the experiment shown in Figure 4. each of four enzymes (AT, KlenTaq- 
278 ST, and KlenTaq-291) was tested for thermostability at 98' C. Each was tested in pairs 
of two concentrations differing by a factor of 2. The volumes of actual enzyme preparation 
are indicated above each lane in ul. The amount used was adjusted from previous titrations 
5 (conducted as described for the 2 kb PCR assay in Example 3 and the legend to Figure 3) so 
that a 2-fold drop-off in activity would be detectable. Note the large amount of ST necessary 
to function at the 95° C control PCR. A previous attempt at this experiment (data not shown) 
used only 1/4 these volumes of ST (which would have been equivalent standard DNA 
polymerase incorporation units compared to KT-291 and KT-278), and no product was 
10 obtained. 

EXAMPLE S 
Wn plf. Colony PCR 
The analysis of single E^coU colonies by PCR is a convenient screen for the 
presence aad/or orientation of desired DNA fragments during a cloning or recloning 
IS procedure. In the prior art. the bacteria may not be simply added to a complete PCR 

reaction, since they evidently do not lyse efficiently enough to release the plasmid DNA mat 
is to be the template for the PCR. Instead, and cumbersome*, since it requires a complete 
extra set of labelled test tubes, bacteria must first be suspended in water, not buffer, in the 
optional but recommended (Riggs etaL) presence of chelating resin, and heated to 100° C for 
20 several (such as 10) minutes. Then 1-10 ul of the heated bacterial suspension is added to an 
otherwise complete PCR reaction, which is then cycled and analyzed normally. 

The improvement here is that, since Klentaq-278 can withstand 98-99° C. 
during the denaturation step of each PCR cycle, the bacteria can be added directly and 
conveniently to a complete (including Klentaq-278 enzyme) PCR reaction and men the PCR 
25 cycling can begin without further pretreatment. The only difference from a normal PCR 
cycling is that the full 98° C (2 min.) or 99' C (1 min.) temperature is used during each 
denaturation step (or at least the first 5-10 steps) of the PCR. The experiment in Figure S 
used 2 min. at 98« C for all 25 cycles, and demonstrate, that this method gives rise to a more 
intense and reliably distinguished product band even man the prior art method which utilizes a 
30 10- 100" C separate treatment. This improvement is not possible with AT enzyme, since AT 
enzyme is inactivated at 98" C (as shown in figures 3 and 4). 

Figure 5 is a photograph of an agarose gel of a demonstration of the 
advantage of a 98' C denaturation step in colony PCR, compared to the standard 95° C 
temperature. L«« 1 and 3 employed the prior art pre-treatment of the bacteria in distilled 
35 water at 100° C for 10 minutes before addition to the PCR reaction. Lanes 2 and 4 

conveniently dispensed with this step and the same amount of bacterial suspension (about 2 to 
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4 X 10« cells, but the identical volume of the same bacterial suspension) was simply 
introduced into the complete PCR reaction (including buffer, triphosphates, primers and 
enzyme KlenTaq-278.) Lanes 1 and 2 employed the standard 95' C, and lanes 3 and 4 

employed the newly possible 98' C denaturation/celWisruption temperature. The cycle 

conditions were 2 min. at 98' C or 95' C. 2 min. at 65' C, and 5' at 72' C, for 25 cycles. 

The primers used were KT2 (37mer GAG CCA TGG CCA ACC TGT GGG GGA GGC TTG 
. AGG GGG A) and KlenTaqS? (see Figure 1). The bacterial cells were X7029 containing 

plasmid P WB319, a broad-host range plasmid containing the coding region of the gene for 

KlenTaq-278. 

Lane 4 is the most convenient and the most effective method, and it takes 
advantage of the new stability of KlenTaq-278. 

EXAMPLE 6 
Efficent and Accurate PTR Amnlifimtinn «t 
Lone DNA Tai^tc- (P.- ^ 

A preferred embodiment of the above formulation (designated KlenTaq-LA): 
Starting with the purified enzymes in storage buffer, mix 1 ul of Pfi, DNA polymerase at 2.5 
u./ul with 64 ul of KlenTaq-278 at 25 u./ul. Store at -20° C. 

Larger amounts of Pfu are detrimental to some PCR amplifications, perform 
equally for some, and are beneficial for some. For testing of the optimum level of Pfu, 
several reactions complete with KlenTaq-278 are aliquoted in the amount left to right of 75 ul, 
25 ul. 25 ul. and as many additional 25 ul aliquots as desired. Then 3/8 ul of Pfu (equivalent 
to 0.5 ul per 100 ul - this is about the most that one would ever want) is added to the 
leftmost, 75 ul reaction and mixed. Serial, two-fold dilutions are then made as 25 ul + 25 ul 
left to right along the row of tubes, adding no Pfu to the last one. as a control of HenTaq- 
278 alone. A reaction of 1/2 or 1 ul (per 100 ul) of Pfu alone should also be run. 

Reaction buffer is Pa as above, supplemented with 200 uM of each dNTP 
and 800 uM of MgCl, (total Mg~ 3.3 mM), and per 100 ul of reaction volume, 20 pmoles 
of each primer MBL (SEQ ID NO:7) and MBR (SEQ ID NO:8), and 30 ng of XplacS intact 
phage. Per 100 ul of reaction volume, 1 or 1/2 ul of KTLA are effective levels of enzyme. 
Suitable PCR cycling conditions are two-temperature: 20 seconds at 94° C, 11 minutes at 
70° C, for 20 cycles. Alternate cycling conditions include two-temperature PGR with 1 
rnmuteat98°CandlOmimitesat65-C. 10 to 16 ul are loaded onto an agarose gel for 
product analysis by staining with ethidium bromide. See Figure 6 for other details and 
variations. The template was XplacS. which carries a portion of the lac operon region of the 
E^coji genome. Thirty ng of phage DNA were included in each 100 ul of reaction volume 
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induced as intact phage particles. The primers are homologous to wild-type lambda DNA 
and amplify X DNA, not the M DNA. Primer MBL No. 8757 (5« nucleotide matches base 
pair 27914 of X DNA) is OCT TAT CTG CTT CTC ATA GAG TCT TGC (SEQ ID NO:7). 
Primer MBR No. 8835 (5' nucleotide matches bp 34570 of X DNA) is ATA ACG ATC ATA 
5 TAC ATG GTT CTC TCC (SEQ ID N0:8). The size of the amplified product is therefore 

predicted to be 6657 bp. 

As shown in Figure 6A and 6B, each DNA polymerase enzyme (KlenTaq- 
278 or Pfu) alone gives rise to a faint product band (except for some reactions, when Pfu 
alone does not work at all), but the combinations all give rise to product bands that are 20 to 
,0 50 times more intense than either enzyme can catalyze on its own. 

Figure 6C, second lane from the right, shows the surprising result of addmg 
as littles 1/64 ul of Pfu to' 1 ul of KlenTaq-278 (a units ratio of 1/640). Not shown are data 
that as little as 1/200 ul (1/2000 in units) of Pfu contributed a noticeable improvement to the 

efficiency of this test amplification. 
1S vent DNA polymerase required 10-fold higher amounts (yet still minority 

amounts) for similar functionality. 

An additional, beneficial, and unexpected attribute to the PCR reactions 
catalyzed by KlenTaq-LA was a phenomenal, never previously observed intensity and 
sharpness to the PCR product bands. In part, this increased yield is manifested by a dark 

20 area in the middle of the bands as photographed. This darker area in the euridium 

flourescence is believed to be due to UV absorbance by the outside portions of the band, 
reducing the potential UV-activated flourescence. The system apparently allowed a much 
greater yield of product then did the prior art, which tended to create a broad smear of 
product, and increasing amounts of side product, when amplification was allowed to proceed 

25 to this extent. 

JEXAMPLE7 

prn^t .-rf Accurate P ™ AT' ifieation of 
t nn p DNA Targ ets: (Part 
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Efficient amplification of 8.4 kb, 12.5 kb, 15 kb, and 18 kb was 
demonstrated by the experiment depicted in Figure 7. This experiment extended the 
demonsttated performance of the a preferred embodiment of the invention. 1/640 KlenTaq- 
LA. even further. The amplification was highly successful for the size range 8.4 to 15 kb, 
detectably successful for 18 kb, but not successful for an attempted 19.7 kb. 

Eigh, different PCR reactions were run in this experiment, differing from 
each other in the template or amount of template or in the primer pair employed, as shown in 
the legend on Figure 7. Each reaction was divided 3 ways and cycled differently in parts A, 
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B.andC. Be^ceopartsAandB.Uiiscxpcrirnen 

denaturation phase. In parts B and C, this experiment compared 94' to 93« far 30 cycles 
mS CXPCrimCnt Utilh£d 13 * 0f (at a Klentao-278/Pfc ratio of 640) per 100 ul 

ofreaction. This may have been a little too much enzyme, since high enzyme has been 
assorted in previous experiments with the catastrophic synthesis of product which cannot 
enter the gel, as occurred here for the reaction products in channels 2B and 6C At the 
, current stage of development of long PGR using the invention, this poor outcome occurs 
about 10% of the time. 

Comparing conditions B and C, it is apparent that somewhat Iow er deration 

temperature is desirable. This is consistent with similar experiments comparing time a, 94' C fa 

which yield of ,ong PCR products was found to be decreased as the denaturation time increased' in the 

and » — -.TC.. fci ^^ rf- ^ ^ 

that there was at least one weak link i e ]«■« t>,.™„..ui 

^' leasl « ta ™**Ne component, in the reactions which is 
subject to inactivation a. 94'. Since 94- is Wow we ^ ^ ^ ^ ^ 

polymerase activity and the DNA. it is believed that i, is no. the thennolabile element In an 
«ve embodiment of this aspect of the invention Pfc DNA poiymerase is replaced as the minority 
^onen, w.th a more thermos,*, 3-.xonuc.ease of a DNA polymerase such as. bu, no, Hmited to 
ma from the Archaebacterium strain ES4, which can grow at tenures up to C fPledger ' 

1 :2 J "" y Gen - Microbio1 - 137 (1991)1> which — «~* 

^a, of the so^ of the Pfc DNA po.ymerase (,«. C, Blumentals, ,,. ^ (1990) M of ^ 
N.Y. Acad. Sci. 589:301-314.) 

. ^^^^-^^^^ofmeMkbbandn.tchedinon.y 

— -*» amounts. This was a measure of the imptoved efficiency provided by the 

•nvenUon. and the fcnher result was that the yield cata.yzed by the invention in 30 cycles gly 
exceeded the yield reported by these autho* for 30 cycles. Accurate mutation has no, yet been 
«H- out to r^sure the efficiency of the two methods. b„, inspection of Figure 7 compa^d to the 
figure pubhshed by Kaitue e^ shows a yield for the ,5 kb fragment that is estimate, to be some 
100 umes htgher. This corresponds approximately to . doubled efficiency of PCR extension. 

EXAMPIJ R 

Efficcnt and A™ m ,. PCR Amp lifi-,,;,., ..«• 
Lone DNA Tury.^- fP.„ q 

Materials an d MethnHc 

DNA Polymerases. DNA polymerases Ven, a.d Deep Ven, were supplied by New 
England Biolabs. Pfu DNA polymerase and its exo" mutan, were supplied by Struugene a, 2.5 
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4>W5b " AM agarose 6*1* "em ron 01 fa^ed in ediidium bromide at 0.5 og'ml and 

hed (35 mm ASA WO bladt and white «..) - ***** <*¥» * " S ™' 81 " 

10 were exposed 50% less than the right halves. 

DNA primers are listed in Table 1 and in the Sequence UsUng. 
,JL DNA opiates. XvacA, a gift from S. Pbadnis. is a XEMBW-vectored^ 

Cone of the cytotoxin gene region of H.U.oha^ori DNA. This DNA was exited an stored 
clone ot toe cyioiowu , XKl3g ^ were added as intact phage 

^ and Accurate PGR. PC2 Reaction buffer (10) consisted ofOTmM Tris-HCl 
„ 8 55 at 25' ^g/ml BSA. 16 mM (NH^O, 3.5 mM KA *° uM each dNTP. For 

1 28 kb 1 35 lb) 1.5 ul of 2 M Tris base was added to each reaction, correspond** to 
success above 28 kb (at 3» kdj. i -» .,«»r Contact with a pH 

EH 9 1 measured for toe Tris-HQ component only a. 20 mM u, water a. 25 C. Contact P 
pH 9.1 measure* so dH was omy measured on separate aliquots. and found to be 

probe was detrimental to ^^j^™^^ 100 ulof reaction volume contained 20 pinoles of each 
g^mthefmalre^onatK C.EachlOOu ^ was for 

primer, and 0., to !0 ng of ph^e DNA template 0.8 or L2 of* ^ ^ ^ 

under 20 kb and over 20 kb. respectively. Reactton volumes per tube were 33 5 
^ oil in thin-walled (PGC or S^gene) plasuc test tube, ^ $ 

PCR reactions utilizing the primers at the ends of X required 
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min. at 68"-72° to disrupt the phage particles and to allow fill-in of the X sticky aids to complete the 
primer homology. Optimal cycling conditions were in a multiple-block instrument (Robo Cycler, 
Stratagene) programmed per cycle to 30 sec. »•. 30 sec. 67°, and 11 to 24 min. at 68°, depending 
on target length over the range shown in Table 1. The second-best cycler was the Omnigene 
(HybAid), programmed under tube control per cycle to 2 sec. at 95% then 68 e for similar anneal- 
ing/extension times. Unless otherwise stated, all of the experiments reported here used 24 cycles. 

For reported .results of comparison of conditions such as cycling temperatures and 
times, thermal cycler machines, thick and thin-walled tubes, etc., reactions were made up as 100 id 
complete and then split into identical aliquots of 33 ul before subjecting to PCR cycling. 
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Table 1 . Primer and template combinations. 
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Product Left 
Size Primer 
SEQID 



Right 
Primer 
SEQID 



5.8 

6657 

8386 

8.7 

12.1 

12.5 

15560 

18.0 

19.8 

20707 

19584 

13971 

22.0 

24.6 

22495 

26194 



25 MBL101 28 
7 MBL 8 
9 MBL-1.7 8 

26 MBR001 29 
22 lacZ333 27 
7 MBL 27mer 8 

25 or MBL101 33mer 27 

10 MSA19 28mer 27 

28 or MSA1933 33mer 

25 MBLI01 

20 XL36 

25 MBL101 
20 XL36 

26 MBR001 33mer 
20 XL36 



20 XL36 

20 XL36 

21 lacZ'533 
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28083 20 XL36 
34968 20 XL36 



27 
24 
29 
22 
29 
21 
28 
23 
29 

24 
27 



MSA1933 

MBR 

MBR 

XR36 

MBR202 

MBR 27mer or 

MBR202 33mer 

MBR202 

MBR202 

MBL002 

XR36 36mer 

lacZ333 

XR36 

lacZ'533 

MSA1933 

lacZ536 

XR36 

MBL002 
MBR202 



Template 
DNA 



XK138 
Xplac5 
Xplac5 
XK138 
XK138 
XvacAI 

XplacS 

XK138 
XK138 
Xplac5 
XplacS 
XplacS 
XKI38 
XK138 
Xplac5 
Xplac5 

Xplac5 
XplacS 
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Legend to Table 1. 

Product sizes in integer base pairs are as predicted from the sequence and structure of X and XplacS as 
documented in Genbank accession no. J02459 and ref. (21). Product sizes with decimal points in kb 
were determined by comparison with these products and with the X+Hindm size standards labelled 
XH3. The sequence of the primers is given in the Sequence listing. 

Megaprimer consisted of gel-purified 384 bp PCR product DNA homologous to the 
region between the BamHl site and EcoRI site of the gene coding for the CryV ICP of Bacillus 
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(14), and primer-modified to remove these restriction sites. The PCR rations in Figure 
10 each employed megaprimer (300 ng) , primer BtV5 and 20 ng of genomic DNA from Bjscillus 
jhunaa^sis strain NRD12 (15), and enzyme as indicated in the descripiton above of F.gure 9. 
Cycling conditions were 30 sec. 9S\ 7 min. 60°. for 20 cycles. 

HindUl digestion. Unfractionated. total PCR reactions for 28 and 35 kb targets 
were supplemented with 1/10 volume of lOXNaTMS (IX = 50 mM NaCl, 10 mM Tris-HCl P H 7.7. 
10 mM Mga„ 10 mM mercnptoethanol) and 2 ul (10 units) of restriction enzyme Hindm, and 

incubated at 55° C. for 90 min. 

Test of exo- Pfu. Each 100 ul of reaction (incubated as 33 ul under 40 ul of o,l) 
contained 2 ng IpbeS DNA as purified phage particles. 20 pmoles each of primers MBL-1.7 and 
MBR reaction buffer PC2 and 1 ul of Klentaq-278 (0.7 ug). except for reaction 6. which contained . 
ul Pfu DNA polymerase (2.5 u.) alone. Other deuils are in the description of Fig. 12. Thermal 
conditions were 24 cycles of 2 sec.at 94M 1 min. at 70«. 

The discovery leading to the DNA polymerase mixture of the present invent.on was 
made during attempts to utilize in PCR a primer with a mismatched A-A base-pair a, its 3' end. In 
fact the primer was itself a PCR product "megaprimer' of 384 bp. and the mismatched A had been 
added by Klentaq-278 using non-templated terminal transferase activity common to DNA polymerases 
(16) Neither Klentaq-278 (Figure 9, lane 3) nor Pfu DNA polymerase (Figure 9, lanes 1 & 2 and 
other levels of enzyme not shown) could catalyze amplification of the 1500 bp target that lay between 
the PCR-product megaprimer and a 42mer oligonucleotide primer. The combination of the two 
enzymes however, was well able to catalyze amplification of the desired target fragment (F,g 9. lane 
4) Evidently, the Pfu DNA polymerase removed the presumed 3' A-A mismatch, allowing Klentaq- 
278 catalysis to proceed efficiently for each step of the PCR. The same result was obtained w,,h Ven, 
DNA polymerase substituted for Pfu (data not shown). 

1 hypothesized that mismatched 3'-ends are a general cause of inefficient primer 
extension during PCR of targets larger than a few kb. As a test system I employed a 6.6 kb lambda 
DNA target which was amplified detectably but poorly by AmpliTaq. Klentaq-278 or Pfu DNA 
polymerase in a variety of standard conditions. Per 100 ul reaction volume, 1 ul of Klentaq-278 was 
combined with various amounts of Pfu DNA polymerase, from 1/2 ul down to as little as 1/200 u. of 
Pfu Since the Pfu stock (2.5 units/ul) was at least 10 times less concentrated man the Klentaq-278 
stock (25-30 units/ul), the actual rUios tested were 1/20 to 1/2000 in DNA polymerase unite. 
Representative results of these tests are shown in Rgure 6B. A high yield of target band was 
observed for all tested combinations of the two enzymes, yet several levels of each enzyme on its own 
failed to catalyze more than faintly detectable amplification. The lowest level of Pf- tested. 1/200 ul. 
35 exhibitedonlyasUghtbeneficialeffect. Theapparent broad optimum ratio of Klentaq^Pfu! was 
16 or 64 by volume, which is about 160 or 640 on the basis of DNA polymerase incorporate unUs. 
When tested a, 6-8 kb (date no. shown), other combinations of 3'-exo" and 3W thermostable DNA 
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polymer also showed the effect, including Amplittq/Pfa, Klentaq-278/Vent, KlentaqS (DeluTaq 
USB) / Pfu, Stoffel Fragment/Pfu. KlenUq-278/Deep Vent (our second choice), and Pfu exo- / Pfu ' 
exo + . Although cumulatively few trials and optimizations were carried out, no other combination 
tried was as effective as Klentaq-278/Pfu. 

A very short heat step is preferred. I next attempted to amplify DNA in the size 
range 8.4 to 18 kb from lambda transducing phage template. Our early cycling protocol employed a 
denaturatmn step of 1 or 2 minutes at 95» or 98« C, but no useful product in excess of 8.4 kb was 
obtained until the parameters of this heat step were reduced to 2 sec. or 20 sec. at 93' or 94° C In 
an experiment with the denaturation step at 94° for 20, 60, or 180 sec. the 8.4 kb product exhibited 
decreasing yield with increased length of this heat step (data no, shown). Apparent*, a component of 
the react,on « a, its margin of mermostability. Figure 7 shows that, using the short 2 sec. 
denaturation step, target fragment was obtained for some reactions a. all sizes in the range 8.4 to 18 
kb, with very high product yields up to 15 kb if 30 PCR cycles were employed. Rgure 7 also shows 
some failed reactions which I cannot explain. The failure mode mat gives rise to massive ethidium 
sunning in the sample well (30-cyc.e lane 2) was particularly common, especially a. high enzyme 



levels. 



Longer Primers. A change in primer length from 27 to 33 greatly reduced the 
frequency of failed reactions. Rgure 10 demonstates improved reliability f„ r amplification of 12 5 
15and 18 leb with the longer 33mer primers, under conditions of otherwise optima.lv high enzyme ' 
20 levels in which the 27mer primers fai.ed to give rise to desirable targe, product. This result does no, 
represent an extensive survey of primer length, and i, has no, ye, been repeated with the improvements 
below. Therefore the optimum primer length for long PCR remains to be determined. Some of the 
amplifications analyzed in Figure 11 utilized 36mer primers from the very ends of X A 2-5 min 
preincubation a, 68-72" (22) was necessary to release the template DNA from the phage particles and 
to fill in the sticky ends of lambda to complete the template homology with primers XL36 and XR36. 

Rapid cycling. A change to thin-walled tubes, which have lower heat capacity and 
conduct heat more efficiently, further improved the reactions. Figure 1 1 shows a CHEF pulse-field 
agarose gel analysis of successful amplifications of DNA spans 6-26 kb in size. The target of 28 kb 
was no. amplifiable in the Omnigene thermal cycler (data not shown), but did appear (Figure 12. lane 
30 2) when the RoboCycler was employed. 

Several models of thermal cycler have been employed, and although no, all have 
been optimized, some are preferable to others for long PCR. As may be concluded from the 
advantage of thin-walled tubes noted above, success seems to be positively correlated with a high 
speed of temperature change made possible by the design of the thermal cycler. The RoboCycler 
achieves rapid temperature change by moving tubes from block to block, and observations with a 
thermistor temperature probe indicate mat i, raises the reactions to 93-95" for only 5 sec. under the 
denaturation conditions employed (30 sec. in the 99° block), before rapidly (within 30 sec) returning 
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the reaction to 68° 



20 



Higher pH. THe current record 35 kb (Figure 12, lane 3) was only amplifiable if 

a A nrftliminarv scan of higher pH was carried out (data not shown), and this 
the pH was increased. A preliminary scan oi v 

rt f a, « kb band at pH 8.8 to 9.2, with the optimum at 9. 1 as described in 
resulted in the appearance of the 35 kbbanoaipn 

Method ^ t 

1 any ^ ^ *- e^ges in buffer cendmons ^ ^ose opt^ for 8.4 k , 
For Targe, over 20 k bs extension times exceeding TO mm. are preferred and the extend 
temperature is preferably below 69° C. . . 

Identity of long PCR products. It can be seen in Figures 7, 10 and 1 1 that the 

n .anes 6 and 7) resulted in me expected left arm of lambda (23 *) and 2.3 » band ta* and 
I" plicubie bands terminated by me right PCR pHtne, 447 bp (bare,y visible) from me 28 Kb 

nroduct and 733 1 bp from the 35 kb product. 

Exonudease mutant Ue available mutant of Pfu DNA polymerase (8) winch , 
native in the «,e< se activity was tested. Figure 13 shows mat the mutant of Pfu 

DNA p 0 l y m e ra S efai 1 s,op ro moteeff.cient M npU fi cati 0 nofalon g D N A^e, T.S supports our 

hLL s mat the "tWity - **— ** " "* amPl ' fiCaU ° n " ^ 



25 



size range 



Rdelity test. Since the biological purpose of ^nuclease is to edit base pair 
etches for high region Hdelity , we te*ed me fidelity of the PCR prod« an ^ 
invoivmg the amplification and molecular cloning of an entire lacZ (p^actos.dase gene flanked by 
losellblemLcersdO). Heretofore *^^^ m -^"~ 
640 - mixl of Klentao-278 and Pfu DNA polymerase at least matches mat obtamed for Pfu 
as Klentaq-LA OQenTaq Long and Accurate) reflects this high fidelity performance 
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Table 2. Non-silent mutations introduced into the lacZ gene by 16 cycles of PGR 



(10). 



40 



UcZ+ I*cZ- % Effective Errors Fold Improve- 

Enzyme Blue light Blue mutant cycle no. per 10 s ment over 

( c > t»P(b) full-length Taq 



or White 



KTLA-64 


571 


34 


/ 5.6 


12 


1.05 


12.7 


Pfu 


528 


37 


6.5 


8 


1.9 


6.9 


Klentaq5* 


442 


85 


16.1 


8 


5.1 


2.6 


Klentaq] 


3225 


985 


26.4 


8 


9.0 


1.5 


Amplitaq 


525 


301 


36.4 


8 


13.4 


1.0 



I rTf 11 L eti0 " ° f ^ DNA P ° lyn,erase 

xn ref . 10 . (b) Equation 1 of reference 10 was rearranged to be as 

follows to solve for errors per bp: x - - ( l„ (2F -„ . * , 

where x xs the errors per bp incorporated, 1000 is the effective' 

-roet 61Z e xn the lacZ gene U0, . P is the fraction of blue colonies 
and m 1E the effective ^ ^ ^ ^ onxes 

^ci e nun ^ er was eBtimated at le6s than the ^ ^ c 'Ji;iiiii™ 

the actual efficiency of the reaction, yet higher than the mini™ 

ZZ T ^ f0ld ^ lifi -ion. strand loss due to in om^ 

synthe B16 of product etran(jE _ s a ^ ^ »P te 

a^lxfxcatxon efficiency. Therefore successful (not lost) produc 
molecules are 3 udged to have undergone more than tta calculated 
m-xmum number of replications. KTLA-64 (Klentaq-278 :Pfu: :64 •! by 
volume) was assigned a higher effective cycle number since its 

ZToTl 6tar T 10 timeS 1688 (1 - 5 n9 V6 - " «* 

PHB305) to result in comparable levels of product. 

DISCUSSION 

The previous length limitation for PGR amplification is 
post ult ed to have been caused by low efficiency of extension at t e 
sxtes of xncorporatxon of mismatched base pairs. Although it would 
have seemed that the cure for these mismatches would be to employ 

TeTZT^ 3 '- (editin9) — ■ I ^lieve that when Pfuld 
Vent ^ polymerase are used to catalyze our amplifications on their 
own, thexr faxlure xs due to degradation of the PCR primers by their 
3 --nucleases, especially during the required long synthesis txa.es 
and at optxmally high DNA polymerase levels. Evidently, low levels of 
3 -exonuclease are sufficient and optimal for removal of the mismatches 
to allow the Klenta q - 278 and amplification to proceed, it has ZT 
demonstrated that the optimally low level of 3- -exonuclease can be set 
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■ and flexibly by mixing and dilution. 

Actively. = ^ ^ J ^ ^ ^ high . „ equal 

le vels of the two types of — are used (or ^re .the « component 
is in excess), or if the ratio of exo-/exo+ is 4 or less, the 
effectiveness of the long PCR. even under optimal cycling conditions 
discussed below, is non-existent or much, reduced. 

It is preferred, and for certain applications, important 
that the length and pmperature of the heat denaturation step of the 
PCR be Kept to a minimum. Further, the improvement obtained by 
PCR be Kep Bliqhtly nay correspond to a decrease in template 

increasing the pH slightly y result . f depurination 

« " I^ority^ polymerase component can be found 

,5 less than 20 sec, and most preferably, 5 sec. or less in the reaction 
tself Z 950, is surprisingly effective for the amplification of 3 
rlreal * might have been expected - longer PCR targets woul d 



„T r aenaturat ion time to become completely denatured 
need longer denaturation requires 

complete ^^^r^Sr^Xl til - dually 
" =.r^L"« - 5 sreonds. .his could limit the si, of 
^lifiable product because of the increased depurination caused by 

l0 „ger ^^^10-. were successful with several different 
2S target sequences, with several primer combinations, and 

sizes up To nearly twice the maximum size of inserts cloned into X. 
Lie vLses and plasmids up to 35 Kb in length should now be 
with this syste Shou, J-J. 

30 ^r^^s^. — — - iif rpr t f: E iis 

3enlent and avoids the DKA rearrangement and gene toxicity pitfalls 
of in^e cloning ^ ^ ^ ^ ^ ^ _ ^ ^ 
objects of the invention are achieved and other advantageous results 
35 attained. ^ ^ ^ ^ ^ ^ M methods ^ 

products without departing from the scope of the invention it is 
intended that all matter contained in the above description shall 
interpreted as illustrative and not in a limiting sense. 
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Table of Contents of Sequence Listing 

SEQ ID N0:1: PCR primer KT1. 

SEQ ID N0:2: N- terminus of Klentaq-278. 

lln II S'-V PCR primer MBL on left side of lambda. EMBL4 inserts. 
til £ NO* 8 • PCR primer MBR on right side of lambda EMBL4 inserts 
10 lit S SiS PCR primer MBL-1.7, 1729 bp to left bf MBL. 
iEQ ID NO:10: PCR primer MSA19, at C-terminus of lac*. 
SEQ ID NO: 11: PCR primer Lc5 
SEQ ID NO: 12: PCR primer Lc3 
SEQ ID NO: 13: PCR primer KT2 
15 SEQ ID NO: 14: Taq DNA polymerase gene, KT1 ena 

SEQ ID NO: 15: Taq DNA polymerase gene, 3 end 

SEQ ID NO: 16: Tfl DNA polymerase gene, KT1 ena 

SEQ ID NO: 17: Tfl DNA polymerase gene, 3 ena 

SEQ ID NO:18: PCR primer BtV3 
20 SEQ ID NO: 19: PCR primer BtVS 

SEQ ID NO:20: PCR primer XL36 

SEQ ID NO: 21: PCR primer lacZ'533 

SEQ ID NO: 22: PCR primer lacZ333 

SEQ ID NO:23: PCR primer lacZ536 
25 SEQ ID NO: 24: PCR primer MBL002 

SEQ ID NO: 25: PCR primer MBL101 

SEQ ID NO: 26: PCR primer MBR001 

SEQ ID NO: 27: PCR primer MBR202 

SEQ ID N0:28: PCR primer MSA1933 
30 SEQ ID NO:29: PCR primer XR36 

SEQ ID NO: 30: Phi 29 fragment 1, Region III 

SEQ ID NO -.31: Pfu fragment 1, Region III 

SEQ ID NO: 32: Phi 29 fragment 2, Region I 

SEQ ID NO: 33: Pfu fragement 2, Region I 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Barnes Ph.D., Wayne M 

(ii) TITLE OF INVENTION: DNA polymerases with 

enhanced thermostability and enhanced length and 
efficiency of primer extension 

'.'.ii) NUMBER OF SEQUENCER: 33 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Senniger, Powers, Leavitt & Roedel 

(B) STREET: One Metropolitan Square 

(C) CITY: St. Louis 

(D) STATE: Missouri 

(E) COUNTRY: USA 

(F) ZIP: 63102 

(v) COMPUTER READABLE FORM: A 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 22-FEB-1994 

(C) CLASSIFICATION: 

{viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Blosser, G.Harley 

(B) REGISTRATION NUMBER: 33,650 

(C) REFERENCE /DOCKET NUMBER: WNB4903 

(ix) TELECOMMUNICATION INFORMATION: 
<A) TELEPHONE: (314) 231-5400 

(B) TELEFAX: (314) 231-4342 

(C) TELEX: 6502697583 MCI 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: synthetic 

(B) CLONE: KT1 
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( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 6, .35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAGCC ATG 'GGC CTC CTC CAC GAG TTC GGC CTT CTG G 
Met Gly Leu Leu His Glu Phe Gly Leu Leu 

/ 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Leu Leu His Glu Phe Gly Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: YES 

(v) FRAGMENT TYPE: C- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: synthetic 

(B) CLONE: Klentaq32 

( ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: complement (8.. 34 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 
GCGAAGCTTA CTACTCCTTG GCGGAGAGCC AGTCC 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asp Trp Leu Ser Ala Lys Glu * * 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6714 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Expression vector 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pWB254b 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1665 



48 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 5- 

55 60 

75 80 

asajsaaagaga-gggj;. 

105 110 

22255SB522SS522S "> 



120 125 



528 



576 



624 



672 
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**r (TG TGG GGG AGG CTT GAG GGG 432 

gssssssssa&ji — « -» 

130 135 

— ffsssssaasssss 1,0 

Glu Glu Arg Leu Leu Trp Leu ±yr igs i 6 0 

145 1 

s s s s s s a s s s? s s =sss 

y 180 

„- rrr rr-rr n^c CTC AAC TCC 
GAG GCC GAG GTC TTC CGC CTG GCC GGC CAC CCC TTC ^ ^ ^ 
Glu Ala Glu val Phe Arg Leu Ala Gly n 205 
195 

SSE2 sr. s s s s s a a s s s s 

210 

ATC GGC AAG ACG GAG AAG ACC GGC AAG CGC TCC ACC AGC GCC GCC £C 

He Gly Lys Thr Glu Lys Thr eiy ^y » 235 2 4 0 

225 

s a s = s s s sssa s 2 s 2 a 

245 

5S22SS2K 2 S 55 S 5 S 2 2 

y 260 24,3 

GAC CXC ATC CAC CCC AGG ACG GGC « «C CAC ACC CGC TTC AAC CAG 

Asp Leu He His Pro Arg Thr Gly ^rg ^ 

SS S S S S 5 i 2 = 2 2 2 2 2 2 2 
SS2S2S22S22J22SS2 

305 

SS22SS2222=5" S S S 

325 

„„„ TCC ggc GAC GAG AAC CTG ATC CGG 

GAG CTC AGG GTG CTG GCC CAC CTC TCC GGC GAC ^ ^ ne ^ 

Glu Leu Arg Val Leu Ala His Leu be 350 

340 J * 
GTC TTC CAG GAG GGG CGG GAC ATC CAC ACG GAG ACC GCC AGC TGG ATG 
Val Phe Gin Glu Gly Arg Asp |le m 365 

355 3feU ' 

TTC GGC GTC CCC CGG GAG GCC GTG GAC CCC « ATG CSC CGG GCG GCC 
Phe Gly Val Pro Arg Glu Ala val ^ 38Q 
370 

SS5S2SSSSSSSSSSS 



816 



864 



912 



1056 



1104 



1152 



385 390 
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25 430 

= a a a? aasasaasaa a - 

^ * U 445 y 

aaaaasaaas a £ 5 s s s 

460 

ssaaasasKf sasass 
aaaaaaaasaaaacaa 

aob 510 

ssgaasasasaasaas 

ATG GAG GGG GTG TAT rrn r^r> ~~ 

-s-"-agasaaaaa S «j5. 

s a a 25 s s s s s a 

550 555 

mum * raccni ATCftGCGGGC ™— 

™ C TCGTGATACG OCTA^A taggtoatc 
TAGCGTCAAA Ca.CC™ TACGCGCCCT ^ 

_G C_ GCTACACTTG e_ _ ^ 
™ C C„ ACGTTCGCCG GCTTTCCCCG TCAAGC^ 
TCCCTTTAGG GncCGA^ 

= ™ ™ — ™ ™ 

cm^ gg*^ tccaaacttg aacaacactc aacccta^t 

««™ TTTTGATTTA TAAGGGATTT 

M«m, ACAAAAATTT AACCCCAA^ ^ACAAAA* attAACGTTT ACAA^ 
O^GCACTT T^GOGAAA, GTGCGCGGAA CCCCTAT™ ITTATTTTTC TAAATACATT 
CAAATATGTA TCCGC^ AGACAA.AAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 
OGAACACTAT GAGTA1TCAA TO ^ 
GCCTTCCTGT CCAGAAACGC 



1248 

1296 

1344 

1392 

1440 

1488 

1536 

1584 

1632 

1682 

1742 

1802 

1862 

1922 

1982 

2042 

2102 

2162 

2222 

2282 

2342 

2402 

2462 

2522 
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TGGGTGCACG 
TTCGCCCCGA 
TATTATCCCG 
ATGACTTGGT 
GAGAATTATG 
CAACGATCGG 
CTCGCCTTGA 
CCACGATGCC 
CTCTAGCTTC 
TTCTGCGCTC 
GTGGGTCTCG 
TTATCTACAC 
TAGGTGCCTC 
AGATTGATTT 
ATCTCATGAC 
AAAAGATCAA 
CAAAAAAACC 
TTCCGAAGGT 
CGTAGTTAGG 
TCCTGTTACC 
GACGATAGTT 
CCAGCTTGGA 
GCGCCACGCT 
CAGGAGAGCG 
GGTTTCGCCA 
TATGGAAAAA 
CTCACATGTT 
AGTGAGCTGA 
AAGCGGAAGA 
GCATATGGTG 
TCCGCTATCG 
CGCGCCCTGA 
CGGGAGCTGC 
CATCAAAAAT 



AGTGGGTTAC 
AGAACG1TIT 
TGTTGACGCC 
TGAGTACTCA 
CAGTGCTGCC 
AGGACCGAAG 
TCGTTGGGAA 
TGCAGCAATG 
CCGGCAACAA 
GGCCCTTCCG 
CGGTATCATT 
GACGGGGAGT 
ACTGATTAAG 
AAAACTTCAT 
CAAAATCCCT 
AGGATCTTCT 
ACCGCTACCA 
AACTGGCTTC 
CCACCACTTC 
AGTGGCTGCT 
ACCGGATAAG 
GCGAACGACC 
TCCCGAAGGG 
CACGAGGGAG 
CCTCTGACTT 
CGCCAGCAAC 
CTTTCCTGCG 
TACCGCTCGC 
GCGCCTGATG 
CACTCTCAGT 
CTACGTGACT 
CGGGCTTGTC 
ATGTGTCAGA 
AATTCGCGTC 
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ATCGAACTGG ATCTCAACAG 
CCAATGATGA GCACTITTAA 
GGGCAAGAGC AACTCGGTCG 
CCAGTCACAG AAAAGCATCT 
ATAACCATGA GTGATAACAC 



GAGCTAACCG CriTl'l 1 TGCA 
CCGGAGCTGA ATGAAGCCAT 
GCAACAACGT TGCGCAAACT 
TTAATAGACT GGATGGAGGC 
GCTGGCTGGT TTATTGCTGA 
GCAGCACTGG GGCCAGATGG 
CAGGCAACTA TGGATGAACG 
CATTGGTAAC TGTCAGACCA 
TTTTAATTTA AAAGGATCTA 
TAACGTGAGT TTTCGTTCCA 
TGAGATCCTT TTTTTCTGCG 
GCGGTGGTTT GTTTGCCGGA 
AGCAGAGCGC AGATACCAAA 
AAGAACTCTG TAGCACCGCC 
GCCAGTGGCG ATAAGTCGTG 
GCGCAGCGGT CGGGCTGAAC 
TACACCGAAC TGAGATACCT 
AGAAAGGCGG ACAGGTATCC 
CTTCCAGGGG GAAACGCCTG 
GAGCGTCGAT TTTTGTGATG 

gcggcctitt tacggttcct 

TTATCCCCTG ATTCTGTGGA 
CGCAGCCGAA CGACCGAGCG 
CGGTATTTTC TCCTTACGCA 
ACAATCTGCT.CTGATGCCGC 
GGGTCATGGC TGCGCCCCGA 
TGCTCCCGGC ATCCGCTTAC 
GGTTTTCACC GTCATCACCG 
TGGCCTTCCT GTAGCCAGCT 



CGGTAAGATC 
AGTTCTGCTA 
CCGCATACAC 
TACGGATGGC 
TGCGGCCAAC 
CAACATGGGG 
ACCAAACGAC 
ATTAACTGGC 
GGATAAAGTT 
TAAATCTGGA 
TAAGCCCTCC 
AAATAGACAG 
AGTTTACTCA 
GGTGAAGATC 
CTGAGCGTCA 
CGTAATCTGC 
TCAAGAGCTA 
TACTGTCCTT 
TACATACCTC 
TCTTACCGGG 
GGGGGGTTCG 
ACAGCGTGAG 
GGTAAGCGGC 
GTATCTTTAT 
CTCGTCAGGG 
GGCCTTTTGC 
TAACCGTATT 
CAGCGAGTCA 
TCTGTGCGGT 
ATAGTTAAGC 
CACCCGCCAA 
AGACAAGCTG 
AAACGCGCGA 
TTCATCAACA 



CTTGAGAGTT 
TGTGGCGCGG 
TATTCTCAGA 
ATGACAGTAA 
TTACTTCTGA 
GATCATGTAA 
GAGCGTGACA 
GAACTACTTA 
GCAGGACCAC 
GCCGGTGAGC 
CGTATCGTAG 
ATCGCTGAGA 
TATATACTTT 
CTTTTTGATA 
GACCCCGTAG 
TGCTTGCAAA 
CCAACTCTTT 
CTAGTGTAGC 
GCTCTGCTAA 
TTGGACTCAA 
TGCACACAGC 
CTATGAGAAA 
AGGGTCGGAA 
AGTCCTGTCG 
GGGCGGAGCC 
TGGCCTTTTG 
ACCGCCTTTG 
GTGAGCGAGG 
ATTTCACACC 
CAGTATACAC 
CACCCGCTGA 
TGACCGTCTC 
GGCAGAACGC 
TTAAATGTGA 
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2762 
2822 
2882 
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3242 
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3362 
3422 
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3602 
3662 
3722 
3782 
3842 
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4022 
4082 
4142 
4202 
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GCGAGTAACA 
AGGTTACGTT 
CGACAGTATC 
CCGGAAACCA 
CGGTGCGGGC 
TAAGTTGGGT 
CCGTAATCAT 
AACATACGAG 
ACATTAATTG 
CATTAATGAA 
TTTTCTTTTC 
GAGTTGCAGC 
GGTTGACGGC 
ATCCGCACCA 
ATCGTTGGCA 
TTGAAAACCG 
GCGAGTGAGA 
GCCCGCTAAC 
CGTACCGTCT 
AAATAACGCC 
CGGATAGTTA 
ACAGGCTTCG 
GGCGCGAGAT 
GGCAACGCCA 
GTAATTCAGC 
GGCCTGGTTC 
GTATAACGTT 
TGCCATACCG 
TCATAGCTGT 
GGAAGCATAA 
TTGCGCTCAC 
GGAGCTTACT 
TGTGAGCGGA 
CTGTTGACAA 



ACCCGTCGGA 
GGTGTAGATG 
GGCCTCAGGA 
GGCAAAGCGC 
CTCTTCGCTA 
AACGCCAGGG 
GGTCATAGCT 
CCGGAAGCAT 
CGTTGCGCTC 
TCGGCCAACG 
ACCAGTGAGA 
AAGCGGTCCA 
GGGATATAAC 
ACGCGCAGCC 
ACCAGCATCG 
GACATGGCAC 
TATTTATGCC 
AGCGCGATTT 
TCATGGGAGA 
GGAACATTAG 
ATGATCAGCC 
ACGCCGCTTC 
TTAATCGCCG 
ATCAGCAACG 
TCCGCCATCG 
ACCACGCGGG 
ACTGGTTTCA 
CGAAAGGTTT 
TTCCTGTGTG 
AGTGTAAAGC 
TGCCCGCTTT 
CCCCATCCCC 
TAACAATTTC 
TTAATCATCG 
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TTCTCCGTGG GAACAAACGG 
GGCGCATOGT AACCGTGCAT 
AGATCGCACT CCAGCCAGCT 
CATTCGCCAT TCAGGCTGCG 
TTACGCCAGC TGGCGAAAGG 



TTTTCCCAGT CACGACGTTG 



GTTTCCTGTG TGAAATTGTT 
AAAGTGTAAA GCCTGGGGTG 
ACTGCCCGCT TTCCAGTCGG 
CGCGGGGAGA GGCGGTTTGC 
CGGGCAACAG CTGATTGCCC 
CGCTGGTTTG CCCCAGCAGG 
ATGAGCTGTC TTCGGTATCG 
CGGACTCGGT AATGGCGCGC 
CAGTGGGAAC GATGCCCTCA 
TCCAGTCGCC TTCCCGTTCC 
AGCCAGCCAG ACGCAGACGC 
GCTGGTGACC CAATGCGACC 
AAATAATACT GTTGATGGGT 
TGCAGGCAGC TTCCACAGCA 
CACTGACGCG TTGCGCGAGA 
GTTCTACCAT CGACACCACC 
CGACAATTTG CGACGGCGCG 
ACTGTTTGCC CGCCAGTTGT 
CCGCTTCCAC TTTTTCCCGC 
AAACGGTCTG ATAAGAGACA 
CATTCACCAC CCTGAATTGA 
TGCGCCATTC GATGGTGTCC 
AAATTGTTAT CCGCTCACAA 
CTGGGGTGCC 'TAATGAGTGA 
CCAGTCGGGA AACCTGTCGT 
CTGTTGACAA TTAATCATCG 
ACACAGGAAA CAGGATCGAT 
GCTCGTATAA TGTGTGGAAT 



CGGATTGACC 
CTGCCAGTTT 
TTCCGGCACC 
CAACTGTTGG 
GGGATGTGCT 
TAAAACGACG 
ATCCGCTCAC 
CCTAATGAGT 
GAAACCTGTC 
GTATTGGGCG 
TTCACCGCCT 
CGAAAATCCT 
TCGTATCCCA 
ATTGCGCCCA 
TTCAGCATTT 
GCTATCGGCT 
GCCGAGACAG 
AGATGCTCCA 
GTCTGGTCAG 
ATGGCATCCT 
AGATTGTGCA 
ACGCTGGCAC 
TGCAGGGCCA 
TGTGCCACGC 
GTTTTCGCAG 
CCGGCATACT 
CTCTCTTCCG 
CAGTGAATCC 
TTCCACACAT 
GCTAACTCAC 
GCCAGCTGCA 
GCTCGTATAA 
CCAGCTTACT 
TGTGAGCGGA 



GTAATGGGAT 
GAGGGGACGA 
GCTTCTGGTG 
GAAGGGCGAT 
GCAAGGCGAT 
GCCAGTGAAT 
AATTCCACAC 
GAGCTAACTC 
GTGCCAGCTG 
CCAGGGTGGT 
GGCCCTGAGA 
GTTTGATGGT 
CTACCGAGAT 
GCGCCATCTG 
GCATGGTTTG 
GAATTTGATT 
AACTTAATGG 
CGCCCAGTCG 
AGACATCAAG 
GGTCATCCAG 
CCGCCGCTTT 
CCAGTTGATC 
GACTGGAGGT 
GGTTGGGAAT 
AAACGTGGCT 
CTGCGACATC 
GGCGCTATCA 
GTAATCATGG 
TATACGAGCC 
ATTAATTGCG 
TTAATGAATC 
TGTGTGGAAT 
CCCCATCCCC 
TAACAATTTC 



4622 

4682 

4742 

4802 

4862 

4922 

4982 

5042 

5102 

5162 

5222 

5282 

5342 

5402 

5462 

5522 

5582 

5642 

5702 

5762 

5822 

5882 

5942 

6002 

6062 

6122 

6182 

6242 

6302 

6362 

6422 
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6542 
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ACACAGGAAA CAGGATCTGG GCCCTTCGAA ATTAATACGA CTCACTATAG «C 
ACGGTTTCCC TCTAGAAATA ATTTTGTITA ACTTTAAGAA GGAGATATAT CC 



(2) 



INFORMATION FOR SEQ ID NO: 6: 

{ i ) - SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 554 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
Met flly Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro Lys Ala Leu 

Olu Glu Ala Pro Tr^ Pro Pro Pro Glu Oly ^ «- Val Gly Phe Val 

20 ^ 
Leu Ser Arg Lys Olu Pro Met Trp Ala Asp Leu Ala Leu Ala ,1a 
35 

, » ,roi Hi n Arc Ala Pro Glu Pro Tyr Lys Ala Leu 
Ala Arg Gly Gly Arg Val His Arg Aia * ^ 

50 55 

if Asp Leu Lys Glu Ala Arg Gly Leu Leu Ala Lys Asp L*u Ser val 

1 Ala Leu Arg Olu Gly I*u Gly Leu Pro Pro Gly Asp Asp Pro Met 



85 90 



Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val 

100 105 



6662 
6714 



^ riu Tro Thr Glu Glu Ala Gly Glu Arg Ala 

Ala Arg Arg Tyr Gly Gly Glu Trp Tnr * ^ 

115 1<iU 
Ma Leu ser Glu Arg Leu Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly 

130 " s 
Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser 

145 150 

XU val Leu Ala His Met Glu Ala Thr Gly Val Arg Leu Asp Val Ala 

165 

T^,, riu Val Ala Glu Glu He Ala Arg Leu 
Tyr Leu Arg Ala Leu Ser Leu Glu Val Aia ^ igQ 

180 1B3 
G lu Ala Glu val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser 

195 2UU 

n *™ val Leu Phe Asp Glu Leu Gly Leu Pro Ala 
Arg Asp Gin Leu Glu Arg Val Leu rne v ^ 

210 215 

ne Gly Lys Thr Glu Lys" Thr Gly Lys Arg Ser Thr Ser Ala Ala Val 

225 23 ^ 

Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys He Leu Gin 

245 2SU 
Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr He Asp Pro Leu Pro 



260 
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A*P Leu lie His Pro Arg Thr Gly Arg Leu „ i6 Thr Arg P he Asn Gin 

280 28S 
Thr Ala Thr Ala Thr Gly £ ^ Ser Ser Sfir ^ ^ ^ ^ 

^" 300 
Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala P he 

315 320 
He Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin He 

/ 330 335 

Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu lie Arg 

345 3S0 * 

Val Phe Gin Glu Gly Arg Asp lie His Thr Glu Thr Ala Ser Trp Met 

360 36S 

Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala 

J/ => 380 

I** Thr lie Asn Phe Gly Val Leu Tyr' Gly Met Ser Ala His Arg Leu 

395 400 
Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu Ala Gin Ala Phe lie Glu 

Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp Il e G1 „ Lys Thr 

L6U tit Hi Val G1 » V* Leu Phe Gly Arg 

440 445 

Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys Ser Val Arg Glu 

" s 460 
Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala 

Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Glu 11 

490 495 
Met Gly Ala Arg Met Leu Leu Gin Val His Asp Glu Leu Val Leu Glu 



505 510 



Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu Ala Lys Glu Val 

520 525 

Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly 

535 540 



lie Gly Glu Asp Trp Leu Ser Ala Lys Glu 



550 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 
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(B) STRAIN: PaPa 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GCTTATCTGC TTCTCATAGA GTCTTGC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 
B TYPE: nuclei acid 
(C STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<vi) 0 ^ I XS: E bacteripha g e lan*da 
(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B> CLONE: MBR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
ATAACGATCA TATACATGGT TCTCTCC 
(2) INFORMATION FOR SEQ ID NO:9: 

H) SEQUENCE CHARACTERISTICS: 
U (A) LENGTH: 27 base pairs 
(B) TYPE : nucleic acid 
(C STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
TTTTGCTGGG TCAGGTTGTT CTTTAGG 
(2) INFORMATION FOR SEQ ID NO:10: 
m SEQUENCE CHARACTERISTICS : 
U) IaTlENGTH: 28 base pairs 
(B) TYPE: nucleic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: C- terminal 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 
<B) STRAIN: K12 

(vii) IMMEDIATE SOURCE: 
(B-) CLONE: MSA19 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: lacZ 

,'xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGAAGCTTAT TTTTGACACC AGACCAAC 

28 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA to mRNA 
(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea maize 

(vii) IMMEDIATE SOURCE : 

(B) CLONE: Lc5 

(viii) POSITION IN GENOME : 

(B) MAP POSITION: 5' end of color control gene Lc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
GTGATGGATC CTTCAGCTTC CCGAGTTCAG CAGGCGG 3? 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iv) ANTI- SENSE: YES 
(v) FRAGMENT TYPE: C- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea maize 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Lc3 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 3< end of color control gene Lc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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GGTCTCGAGC GAAGCTTCCC TATAGCTTTG CGAAGAG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: ^. c 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: KT2 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
GAGCCATGGC CAACCTGTGG GGGAGGCTTG AGGGGGA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

K (A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 
(Vii> "JST^fSSSU Accession no. a0«39 

(Viii) DNA Phrase gene 

(B) MAP POSITION: 950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
AGTTTGGCAG CCTCCTCCAC GAGTTCGGCC TTCTGG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear , 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 

(vii) IMMEDIATE SOURCE: 
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(B) CLONE: Genbank Accession No. J04639 
(viii) POSITION IN GENOME : 

(A) CHROMOSOME/ SEGMENT : DNA polymerase gene 
<B) MAP POSITION: 2595 * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGACTGGCTC TCCGCCAAGG AGTGATACCA CC 

32 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: ' 

(A) ORGANISM: Thermus flavis 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Genbank Accession No. X66105 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT : DNA polymerase 

(B) MAP POSITION: 1378 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
AGTTTGGAAG CCTCCTCCAC GAGTTCGGCC TCCTGG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus flavis 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Genbank Accession No. X66105 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT: DNA polymerase aene 

(B) MAP POSITION: 3023 ^ 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

r 

GGACTGGCTC TCCGCCAAGG AGTAGGGGGG TCCTG 35 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: YES 
(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: , . 

(A) ORGANISM: Bacillus thurmgiensis 

(B) STRAIN: CryV 

(C) INDIVIDUAL ISOLATE: NRD12 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: BtV3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCGAAGCTTC TCGAGTTACG CTCAATATGG AGTTGCTTC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 43 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: , . 

(A) ORGANISM: Bacillus thurmgiensis 

(B) STRAIN: NRD12 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: BtVS 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 
CCGAGATCTC CATGGATCCA AAGAATCAAG ATAAGCATCA AAG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: L36 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: left end 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGGCGGCGAC CTCGCGGGOT TTCGCTATTT ATGAAA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

/ 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: YES 
(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 
<B) STRAIN: K12 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: lacZ'533 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 
CGACGGCCAG TGAATCCGTA ATCATGGTCA TAG 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: C- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 

(B) STRAIN: K12 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: lac2333 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: lacZ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
ACCAGCCATC GCCATCTGCT GCACGCGGAA GAA 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iv) ANTI- SENSE: NO 
(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.COli 

(B) STRAIN: K12 

(vii) IMMEDIATE SOURCE : 

(B) CLONE: lacZ536 

(vUi) POSITION IN GENOME': 

(B) MAP POSITION: lacZ 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:23: 
CTATGACCAT GATTACGGAT TCACTGGCCG TCGTTT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 3 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: MBL002 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCAAGACTCT ATGAGAAGCA GATAAGCGAT AAG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Vi) ^f^^cteriophage lantoda 
(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: MBL101 

(viii) POSITION IN GENOME: ' 
(B) MAP POSITION: 27840 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ATCATTATTT GATTTCAATT TTGTCCCACT CCC 
(2) INFORMATION FOR SEQ ID NO: 26: 



36 



33 



33 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

/ 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: MBR001 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 34576 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GGAGAGAACC ATGTATATGA TCGTTATCTG GGT 
(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: MBR202 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 34793 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GCGCACAAAA CCATAGATTG CTCTTCTGTA AGG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: C- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 

(B) STRAIN: K12 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: MSA1933 
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(viii) POSITION IN GENOME: 

(B) MAP POSITION: lacZ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCCGGTTATT ATTATTTTTG ACACCAGACC AAC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bafce pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 1am>v * a 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: R36 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: right end 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 
AGGTCGCCGC CCCGTAACCT GTCGGATCAC CGGAAA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: phi 29 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME / SEGMENT : DNA polymerase 

(B) MAP POSITION: ending at 435 



33 



36 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Lys Leu Met Leu Asn Ser Leu Tyr Gly Lys Phe Ala Ser Asn Pro Asp 

val Thr Gly Lys Val Pro Tyr Leu Lys Olu Asn Gly Ala Leu Gly Phe 

20 ' 25 

Arg Leu Gly Glu Glu Glu Thr Lys Asp Pro Val Tyr Thr Pro Met Gly 



35 40 



Val Phe He Thr Ala 
50 



(2) INFORMATION FOR SEQ ID NO: 31: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pyrococcus furiosus 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT: UNA polymerase 

(B) MAP POSITION: ending at 516 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Asp Tyr Arg Gin Lys Ala lie Lys Leu Leu Ala Asn Ser Phe Tyr Glv 
5 .10 



15 



Tyr Tyr Gly Tyr Ala Lys Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu 
™ 25 30 

Ser Val Thr Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: phi 29 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : DNA polymerase 

(B) MAP POSITION: ending at 485 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Trp Ala Arg Tyr Thr Thr He Thr Ala Ala Gin Ala Cys Tyr Asp Arg 

He He Tyr Cys Asp Thr Asp Ser He His Leu Thr Gly Thr Glu He 

20 25 30 

Pro Asp Val He Lys Asp He Val Asp Pro Lys Lys Leu Gly Tyr Trp 

35 40 AC ^ 



Ala His 
50 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLE COLE TYPE : protein 
(v) FRAGMENT TYPE: internal 



^f^Sf^yrococcus furiosus 

(B) MAP POSITION: ending at 576 

} 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

^ ti 0 nn Leu Val Trp Lys Glu Leu Glu Glu Lys 
Trp Gly Arg LyG Tyr He Glu Leu vax i^y y lg 



1 5 



Phe Gly Phe Lys Val Leu Tyr lie Asp Tnr Asp Gly Leu Tyr Ala Thr 

20 2b 
Ue Pro Gly Gly Glu Ser Glu Glu lie Lys Ly S Lys Ala Leu Glu Phe 

35 40 
val Lys Tyr lie Asn Ser Lys Leu Pro Gly Leu Leu 
50 55 
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WHAT IS CLAIMED IS: 

1. A recombinant DKA sequence encoding a DNA polymerase 
having an amino acid sequence comprising substantially the same amino 
acid sequence as Thermus aquaticus DNA polymerase, but lacking the 
N-terminal 280 amino acid residues of said DKA polymerase. 

2. A DNA polymerase having an amino acid sequence 
comprising substantially the same amino acid sequence as that of 
ThermUS a ^t<riiB DNA polymerase, excluding the N-terminal 280 amino 
acid residues of Thermus aouaticus DNA polymerase. 

3. A DNA polymerase as set forth in claim 2, having the 
amino acid sequence of SEQ ID NO: 6. 

4. A DNA polymerase as set forth in claim 2 that is 
encoded by a DNA sequence contained in plasmid pWB254b. 

5. A DNA polymerase having an amino acid sequence 
comprised substantially of amino acids 280-831 of the DNA polymerase of 
Thermus flavyc 

6. A formulation of thermostable DNA polymerase comprising 
a majority component comprised of at least one thermostable DNA 
polymerase lacking 3 ' -exonuclease activity and a minority component 
comprised of at least one thermostable DNA polymerase exhibiting 3 '- 
exonuclease activity. 

7. A formulation of thermostable DNA polymerase as set 
forth in claim 6 wherein the at least one thermostable DNA polymerase 
lacking 3 ' -exonuclease activity is Klentaq-278. 

8. A formulation of thermostable DNA polymerase as set 
forth in claim 7 wherein the at least one thermostable DNA polymerase 
exhibiting 3 ' -exonuclease activity is selected from the group 
consisting of Pfu DNA polymerase from Pvroco^,* the Tli DNA 
polymerase from Thermo coccus lirnraliR, or a variant of the Pfu DNA 
polymerase or the Tli DNA polymerase wherein the DNA. polymerase 
activity of said DNA polymerase has been diminished or inactivated. 

9 . A formulation of thermostable DNA polymerases as set 
forth in claim 6 wherein the majority component and minority component 
of the formulation are present in a ratio of from about 10 units to 
about 2000 units of the majority component to 1 unit of the minority 
component . 

10. A formulation of thermostable DKA polymerase as set 
forth in claim 9 wherein the majority component and minority component 
of the formulation are present in a ratio of from about 100 to about 
600 units of the majority component to l unit of the minority 
component. 

11. A formulation of DNA polymerase comprising at least 
one DNA polymerase which in wild-type form exhibits 3' -exonuclease 
activity and which is capable of catalyzing a temperature cycle type 
polymerase chain reaction wherein said 3' -exonuclease activity of said 
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- been reduced to between about 0.2% and 
ipflGt one DNA polymerase has peen tenure** 
lut l ot the 3'-elnuclease activity of said at least one DNA 
nolvmerase in itB wild-type form. 

12 A formulation of DNA polymerase comprising El. wherein 
El iB one or more DMA polymerases which lack any significant 3'- 
exonuclease activity, and E2. wherein E2 is one or more DNA polymerases 
which exhibit significant 3- -exonuclease activity and wherein the 
ratio of the amounts pt El to E2 is at least about 4 : 1 by DNA 
polymerase units or (b, at least about 4:1 by amounts of protein by 

WSi9ht ' 13 A formulation of DNA polymerase as set forth in claim 
12 wherein E2 is selected from the group consisting of DNA polymerase 
enccld by genes from Pyococcus furiosus. Thermococcus literalis . 
Thennococcus species GB-D. T7 coliphage, Thermotoga maritima or a 
Tomhination thereof, and wherein » is selected from the group 
consisting of a mutant. 3 • -exonuclease negative form of E2 or of DNA 
^ ses which, in unmutated form, do not exhibit 
LUleaee activity, such as DNA polymerase encoded by genes from 

^ • 1C Thprmue flavus, or Thermus thermophilus. 
Thermus amicus. Q £ qua. polymerase as set forth in claim 

12 wherein E2 comprises Pfu DNA polymerase from Pyococcus furiosus and 

the unit ratio of El to E2 is between about 150 to 170:1. 

the unit ^ f omulation o£ DNA polymerases as set forth m claim 

12 comprising a unit ratio of from about ISO to about 170:1 of Klenta*- 
278 to Pyrococcus furiosus DNA polymerase. 

to Pyr ^ fonnulation of DNA polymerases as set forth in claim 

12 arising a unit ratio of from about 150 to about 170:1 of Klentaq- 
291 to Pyrococcus furiosus DNA polymerase. _ 

17 A formulation of DNA polymerases as set forth in 
claim 12 wherein E2 comprises the DNA polymerase from ^°««=~ B 
species GB-D and the unit ratio of El to E2 is between about 450 to 

500 ' is A formulation of DNA polymerase as set forth in claim 

12 comprising a unit ratio of from about 10 to about 15:1 - 

or nelly fJl-length Thermus aguaticus DNA polymerase to Pyrococcus 

furiosus DNA polymerase . 

19 . a formulation of DNA polymerase as set forth in claim 

12 wherein El comprises a reverse transcriptase. 

20 * formulation of DNA polymerase as set forth m claim 
12 wherein El comprises a mutant or chemical modification of T7 or T3 
» Tolymeras e and E2 comprises a wild-type T7 or T3 

21 A formulation of DNA polymerase as set forth in claim 
12 wherein El comprises Thermus flavus or Thermus thermophilus DNA 
polymerase. ^ ^ £onmilation of dna polymerase as set forth in claim 
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12 wherein El comprises a Thermococcus literals DNA polymerase variant 
lacking any significant 3- -exonuclease activity and E2 comprises wiid- 
type Thermococcus literalis DNA polymerase. 

23. In a method for amplifying nucleic acid sequences by 
polymerase chain reaction, of the repeating cycle type wherein two 
complementary strands, for each different specific sequence being 
amplified, are treated with a single, a pair, or a mixture of pairs of 
oligonucleotide primers, and a DNA polymerase or a mixture of DNA 
polymerases is used to catalyze synthesis of an extension product of 
each primer which is complementary to each nucleic acid strand under 
effective conditions for said synthesis, and wherein said primers are 
selected so as to be sufficiently complementary to different strands of 
each specific sequence to hybridize therewith such that the extension 
product synthesized from one primer, when it is separated from its 
complement, can serve as a template for synthesis of the complementary 
strand of the extension product by extending the same or another 
included primer, the primer extension products are separated from the 
templates on which they were synthesized to produce single -stranded 
molecules, and the single-stranded molecules thus generated are treated 
with the primers and a DNA polymerase or a mixture of DNA polymerases 
is used to catalyze synthesis of an extension product of each primer 
under effective conditions for the synthesis of . primer extension 
product, wherein the improvement comprises formulating the DNA 
polymerase as set forth in claim 6, and catalyzing primer extension 
with said formulation of DNA polymerase. 

24 . In a method for amplifying nucleic acid sequences by 
polymerase chain reaction, of the repeating cycle type wherein two 
complementary strands, for each different specific sequence being 
amplified, are treated with a single, a pair, or a mixture of pairs of 
oligonucleotide primers, and a DNA polymerase or a mixture of DNA 
polymerases is used to catalyze synthesis of an extension product of 
each primer which is complementary to each nucleic acid strand under 
effective conditions for said synthesis, and wherein said primers are 
selected so as to be sufficiently complementary to different strands of 
each specific sequence to hybridize therewith such that the extension 
product synthesized from one primer, when it is separated from its 
complement, can serve as a template for synthesis of the complementary 
strand of the extension product by extending the same or another 
included primer, the primer extension products are separated from the 
templates on which they were synthesized to produce single -stranded 
molecules, and the single -stranded molecules thus generated are treated 
with the primers and a DNA polymerase or a mixture of DNA polymerases 
is used to catalyze synthesis of an extension product of each primer 
under effective conditions for the synthesis of a primer extension 
product, wherein the improvement comprises formulating the DNA 
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polymerase as set forth in claim 7, and catalyzing primer extension 
with said formulation of DNA polymerase. 

25 A method as set forth in claim 24 wherein one or more 
of the primers utilized in any repeating cycle is itself a product of 

PCR amplification. 

26 A method as set forth in claim 24 further comprising a 
denaturation step in/ each repeating cycle wherein the denaturation step 
has a duration of less than about 20 seconds in the reaction mixture. 

27 A method as set forth in *aim 26 wherein the 
denaturation step has a duration of less than about 5 seconds in the 

reaction mixture. 

28 in a method for amplifying nucleic acid sequences by 
polymerase chain reaction, of the repeating cycle type wherein two 
complementary strands, for each different specific sequence being 
emptied, are treated with a single, a pair, or a mixture of pairs of 
Xonucleotide primers, and a DNA polymerase or a mixture of DNA 
polymerases is used to catalyze synthesis of an extension product of 
^primer which is complementary to each nucleic acid strand under 
effective conditions for said synthesis, and wherein said primers are 
selected so as to be sufficiently complementary to different strands of 
each specif * -quence to hybridize therewith such that the extension 
product synthesized from one primer, when it is separated from its 
complement can serve as a template for synthesis of the complementary 
strand of the extension product by extending the same or another 
included primer, the primer extension products are separated from the 
templates on which they were synthesized to produce single-stranded 
macules, and the single -stranded molecules thus generated are treated 
Til the primers and a DNA polymerase or a mixture of DNA polymerases 

I used to catalyze synthesis of an extension product of each primer 
under effective conditions for the synthesis of a primer extension 
^duct. wherein the improvement comprises formulating a DNA polymerase 
comprising at least one DNA polymerase, which in wild-type form 

IXits 3<-exonud ctivity and which is capable of catalyzing a 

temperature cycle type polymerase chain reaction, wherein said - 
Nuclease activity of said at least one DNA polymerase of 
reduced to no greater than about 7* of the 3 ' ^^l^^ 
said at least one DNA polymerase in its wild- type form but not 
"iminated. and catalyzing primer extension with said' formulation of 
DNA polymerase. 
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Thf 1 !^ 1 ^" T ^ e P rimer at the S' side of the target fraoment 
The start codon is indicated with ***. 9 rragment. 

GAGCCATGGGCCTCCTCCACGAGTTCGGCCTTCTGG 36MER (SEQ 10 N0: D 

*** I 1 1 1 1 ■ 1 1 1 1 1 I 1 1 1 1 M 1 1-| 1 1 1 1 1 | 
aa m 9 L L " E F G L L E ... <- upper case ^ m 

ag 27S 280 282 284' 286 268 <-- codon numbering for WT 
AGTTTGGCAGCCTCCTCCACGAGTTCGGCCTTCTGG . . . (SEQ ID NO: 14) 

re^onT" J ° 4639 includes^on^ranlXS ^ 

^S'he^.^" «*> stranrof^ra^ualTri.ner 

—other strand— KLENTAQ32 35mer (SEQ ID NO: 3) 

Hindlll 
26 16 ****** g 

GGACTGGCTCTCCGCCAAGGAGTAGTAAGCTTCGC 

"IMMIIIIIMIIHliiii II 

DWL-SAKE* 
826 828 830 832 

GGACTGGCTCTCCGCCAAGGAGTGATACCACC (SEQ ID NO* 15) 

2604 2614 2624 

TaqPol.seq 

Figure 1 - p rirae rs that amplify the gene for KlenTaq-278 
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GAGCCATGGGCCTCCTCCACGAGTTCGGCCTTCTGG KT1 36MER (SEQ ID NO: 1) 
T l g "l"l U H 'e'f" "l i B ... <" upper case are WT 
Sa 278 280 282 284 286 288 <— codon numbering for WT 
^TTO^CCTCCTCC.^TTCOJCCTCC™ ^J**" ^'xSf 

1387 (numbering includes 5' non-translated 

region) 

16 6 —other strand— 

GGACTGGCTCTCCGCCAAGGAGTAGTAAGCTTCGC KLENTAQ32 35mer 

1111111111111111111111111 1 1 1 



DWLSAKE* 



GGACTGGCTCTCCGCCAAGGAGTAGGGGGGtCCTG Tfl.seq (SEQ ID NO: 17) 

3032 3042 3052 

Figure 2 - The same primers are homologous to Thermus flavus 



DNA 
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2 min. at indicated 99°, 98° or 97<\10 min. 65<> per cycle, 20 cycles. 
AmpliTaq' KlenTaql 

... reaction. 



9 ?° -" 8 - ^ 1/8 1/16 <- ul per 1 00 ul 




98 



<- 2 KB product size 



AmpliTaq KlenTaql 
■8 .4 2. 1/4 1/8 1/16 <-ul per 100 ul reaction. 




97< 



<- 2 KB product size 



AmpliTaq KlenTaql 
.8 .4 .2 1/4 1/8 1/16 <-ul per 100 ul reaction. 



<- 2 KB product size 




Figure 3. KlenTaql is more thermostable than AmpliTaq. 
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A oqo r VR qs° C for. four enzymes catalyzing PCR 
Figure 4 - ^J^.^; S T, KlenTaq-291 



f im ,ra. AA- 2 min. 98' C, 10 min. 65° C, 20 cycles 
BotSE fi^re Si 2 Si 95- C 10 min. 65- C, 20 cycles 



XH3 
std 



AT 



KT-278 



ST 



KT-291 AH3 
std 



3/4 3/8 1/4 1/8 3 1.5 1/4 1/8 




^y^v . j^jjgj 
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Figure 5 ~ Colony PCR benefits from 98° C 

PCR peak 2' : 95° 98° 

Pretreatment : 100° none 100° none 
Lane: 12 3 4 



WO 94/26766 



6/13 



PCT/US94/01867 



Figure 6A,B,C — Large DNA span 6.6 kb PCR 

For all of these gels, the standard lane is a Hindlll digest 
of lambda phage DNA, and the 3d band in this digest is a DNA 
fragment of length 6557 bp. 



5 Figure 6A 

ul of KlenTaq-278 
ul oi Pfu DNA Pol 



6.6 KB — > 



20 cycles of 2 min. 94°, 2 min. 60° , 7 min. 72 



1,1 1 1 0 2 2 2 2 
1/2 1/4 1/8 0 1/2 1/2 1/4 1/8 0 



XH3 




New Art 



Prior 
Art 



New Art 



Prior 
Art 



See next page for Figure 6B and 6C. 
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Figure 6B 



20 cycles of 2 min. 94* , 2 min. 60«, io min. 72 



ul of KlenTaq-278 1 1 1 1 i l o 
Ul of Pfu DNA Pol/ 1/2 1/4 1/8 1/16 1/32 0 1/2 




< — 6.6 KB 



Figure 6C 



20 cycles of 2 min. 95- , l min. 60' , 30 min. 72 



ul of KlenTaq-278 
ul of Pfu DNA Pol 

AH3 



111 11 
1 1/4 1/16 1/64 0 
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Figure 7. Amplification of 8.4 to 18 kb by Klentaq-LA 



Channel : 
Template DNA: 
ng of template: 1 
primer 1 SEQ 10 NO: 9 
primer 2 SEQ 10 NO: 8 
Size of PCR product 
Expected, in kb: 8.4 



JL P lac5 XpcJtT XpcJtT XplacS XplacS XK138 XK138 XK138 



10 
7 
8 



1 
10 
8 



1Z.5 12.5 15 



10 
10 
8 

15 



10 
7 
8 



10 
9 
8 



Conditions ft 



18 18 19.7 
Conditions 8 



XH3 1 




Conditions C 




PCR Cycling Conditions A: 2 seconds 94;, 11 minutes 70-. 20 cycles 

S 835 S3SS2 ci i SSS « 3SS £: To SS: 

XK138 is fSw-vectored clone of E, g»H DNA spanning the 1st operon. 
XH3 size standards visible are 23130, 9416, 6557, 4361, 2322, and 2027 bp. 
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Figure 9 . 
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Figure 10 



Lane: 
target size, kb: 
primer length: 
template/100 ul: 
AH3 



1 2 3 4 5 
12 15 18 12 15 

27 f — — 33- 

2 ng 



6 
18 



7 
12 



8 

15 
•27- 



9 
18 



10 
12 



11 
15 



12 
18 



10 ng 




AH3 



W0 94O6766 

Figure 11 
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2 2.3 



4.4 6.6 9.4 23 49 



Lane Size 

1 S 

2 26194 

3 20707 

4 19.8 

5 15560 

6 S 

7 22495 

8 22.0 

9 24.6 

10 S 

11 12.1 

12 5.8 

13 8.7 

14 18.0 

15 S 



T 
O 
P 
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5 28 35 S 
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7731 



447 



Figure 12 



Figure 13 
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